GENE CLUSTER FOR PRODUCTION OF THE ENEDIYNE 
ANTITUMOR ANTIBIOTICC-1027 

SEQUENCE LISTING 

SEQ ID No. 1. C-1027 gene cluster DNAVsequence from 1 to 42,000, ORF-(-7) to ORF- 
26 \ 

gtcgactctagaggatcccgggtgcggKgtaggggttacggacgaaggaggggtgcccgg 

1 + + \- + + + + 60 

CAGCTGAGATCTCCTAGGGCCCACGCCTCATCCCCAATGCCTGCTTCCTCCCCACGGGCC 
-7_* *LIGPA3YPNRVFSPHG 

CGACGCCTGCGGCGAAGGGCGGTTCCTTXjAGTTCGAGGCCGGTGGCGAGGACGACGTGGT 

61 + + Y + + + + 120 

GCTGCGGACGCCGCTTCCCGCCAAGGAACTCAAGCTCCGGCCACCGCTCCTGCTGCACCA 
-7 AVGAAFPPEKlLELGTALVVH 

CCGCGTCGAGGATCTGCGTGTCGGGGAGCIGGCCCAGGGCGCAGCCCCTCGGTCAGGTACG 
121 + + 4+ + + + 180 

ggcgcagctcctagacgcacagcccctcqccgggtcccgcgtcggggagccagtccatgc 
-7 dadli qtd plIpgprlgetly 

gggtgaggcccctgacggtcacctcgaagqagcggtcgtgggaccgggcgtcgagcgcct 

181 + + +V + + + 240 

cccactccggggactgccagtggagcttcgtcgccagcaccctggcccgcagctcgcgga 
-7 ptlgrvtve fqrdhsradla 

ccccgtccgcttccacaaggacgacgccgggAcaggactcccgtgcggcctcgaccagtc 

241 + + + - -V -- + + + 300 

GGGGCAGGCGAAGGTGTTCCTGCTGCGGCCCTffiTCCTGAGGGCACGCCGGAGCTGGTCAG 
-7 E GDAEVLV.VG p\t S E RAAEVL 

GGGCGTCGAGGTAGTCCTGGAAGATGCGGCGGGGGGCGGGGCCCTGTTCGGTGAACTTCC 

301 + + + -\ + + + 360 

CCCGCAGCTCCATCAGGACCTTCTACGCCGCCCCCCGCCCCGGGACAAGCCACTTGAAGG 
RAD L YDQ F I RR PI A P GQ E T F K 

ACGAAGCCCAGCGCCGGGGCCAGTCGCGCCGGTCQGCCTCCTGGTTGGCCCAGTTGATGA 

361 + + + + + + 420 

TGCTTCGGGTCGCGGCCCCGGTCAGCGCGGCCAGC*GGAGGACCAACCGGGTCAACTACT 
WSAWRRPWDRRDIAEQNAWNI 

AGTCGAGCACGTCCTCGCGGAACACCGACATCCTGCCGGCCTGGATATTGAAGACGTGGT 

421 + + + i- - + + + 480 

TCAGCTCGTGCAGGAGCGCCTTGTGGCTGTAGGACGGCCGGACCTATAACTTCTGCACCA 
FDLVDERFVSMRGAQINFVH 

CCCAGGGGTTGCCGTCACGGTGATAGGCGACGCCGGCXIGAGCGGTAgGCGGCGCGCCGCT 

481 + + + V- + + + 540 

GGGTCCCCAACGGCAGTGCCACTATCCGCTGCGGCCGffiCTCGCCATcCGCCGCGCGGCGA 
DWPNGDRHYAVGAlSRYAARR 

CCAGGAGGACGACTTCCAGCGGTCTTCTCGCGAAATGiw^GCAGGCGTATCGCGGTCGCCG 

541 + + + 1 + + + 600 

GGTCCTCCTGCTGAAGGTCGCCAGAAGAGCGCTTTACTTCGTCCGCATAGCGCCAGCGGC 
ELLVVELPRRAFH1LLR IATA 

TGCCTGCC^GGCCCGCCCCTACGACCAGCACCCTGGGGCJpCGCACCCGTCATGCCCATGA 

60 1 + + + V + + 660 

ACGGACGGTCCGGGCGGGGATGCTGGTCGTGGGACCCCGCGCGTGGGCAGTACGGGTACT 
TGALGAGVVLVRPRAGTMGM 



1 




AGCCTCCCCCGCTGACTCAGGGCGgCGCGTCGCGCGCTCCCGTCGGTGTCCTCGCTGACT 

661 + + + + + + 720 

TCGGAGGGGGCGACTGAGTCCCGCcGCGCAGCGCGCGAGGGCAGCCACAGGAGCGACTGA 

GGAAGTTCCCTGACCTGGCGTCAACTCCACTGATCCGTAAGGGGATCGCGGGAGTGGATA 

721 + + + + + + 780 

CCTTCAAGGGACTGGACCGCAGTTGAGGTGACTAGGCATTCCCCTAGCGCCCTCACCTAT 

CGGGTCAGGTCGTGCACGATCGTGGCACCAGACAGATCACCACGTCGATAGGCACTCGTG 

781 + + + + + + 840 

GCCCAGTCCAGCACGTGCTAGCACCGTGGTCTGTCTAGTGGTGCAGCTATCCGTGAGCAC 

AGCCGCGCCCGGGGCTCGACGGGGCGGGGCACCGGCAGGGGCGGCCGCGTGATCAGCCGG 

841 + + + + + + 900 

TCGGCGCGGGCCCCGAGCTGCCCCGCCCCGTGGCCGTCCCCGCCGGCGCACTAGTCGGCC 

AGCCTGTCCGGGGGCGTGCGTGCGGGGCGTCAGCTGTCGATGTCGGGAACGCCAGGGACG 

901 + + + + + + 960 

TCGGACAGGCCCCCGCACGCACGCCCCGCAGTCGACAGCTACAGCCCTTGCGGTCCCTGC 
-6-* *SDIDPVGPV- 

TCGATCTCGGTGCGGGCGTAGTGGTTGAAGTAGTTGGTGTAGAGGTTCACGGCCACGTGG 

961 + + + + + + 1020 

AGCTAGAGCCACGCCCGCATCACCAACTTCATCAACCACATCTCCAAGTGCCGGTGCACC 
-6 DIETRAYHNFYNTYLNVAVH- 

ACGAAGACCTCGGCGAGCTCGGTGTCCGTCCATCCCTGTGCCACGGCCGCGTTCCACGAG 

1021 + + + + + + 1080 

TGCTTCTGGAGCCGCTCGAGCCACAGGCAGGTAGGGACACGGTGCCGGCGCAAGGTGCTC 
-6 VFVEALETDTWGQAVAANWS- 

GCGTCAGACGCCTCGCCCACTTCGCCGGCGATCTCCCTGGCCACCTGGACCAGTGCTTCG 

1081 + + + + + + 1140 

CGCAGTCTGCGGAGCGGGTGAAGCGGCCGCTAGAGGGACCGGTGGACCTGGTCACGAAGC 
-6 ADSAEGVEGAI ERAVQVLAE 

AGCTTCACGTCGTCGCCGGGCGTCCCCCGGCGAATCGCCACGGTCTCCTCCAGCGTGAAA 

1141 + + + + + + 1200 

TCGAAGTGCAGCAGCGGCCCGCAGGGGGCCGCTTAGCGGTGCCAGAGGAGGTCGCACTTT 
-6 LKVDDGPTGRRIAVTEELTF- 

CCCGCGACCTTCGCCGACACCGTGTGCGCCGCCTGGCAGTACGCGCACGCGTCGACCGCG 

1201 + + + + + + 1260 

GGGCGCTGGAAGCGGCTGTGGCACACGCGGCGGACCGTCATGCGCGTGCGCAGCTGGCGC 
-6 GAVKASVTHAAQCYACADVA- 

CCCACGGCGAGGGCGATCGCCTCGCGTGTGCGGGCGTCGAACGTTCCATGTTCGGCGACG 

1261 + + + + + + 1320 

GGGTGCCGCTCCCGCTAGCGGAGCGCACACGCCCGCAGCTTGCAAGGTACAAGCCGCTGC 
-6 GVALAIAERTRADFTGHEAV- 

GCTCCGGTGATCGCGGCGTAGGTTTCCAGGACCACGGGGGAATGGGCCATTCCCCCGTGG 

1321 + + + + + + 1380 

CGAGGCCACTAGCGCCGCATCCAAAGGTCCTGGTGCCCCCTTACCCGGTAAGGGGGCACC 
-6 AGTIAAYTELVVPSHAMGGH 

ATGTTGAGCACTCGCCCGAACCGCTTCTCCAGTCGGCGCAGGATGTCTCCGCCGGCTGCG 

1381 + + + + + + 1440 

TACAACTCGTGAGCGGGCTTGGCGAAGAGGTCAGCCGCGTCCTACAGAGGCGGCCGACGC 
-6 INLVRGFRKELRRLIDGGAA- 

GGTGCGGTGTCGATGGTGTGGACGGGAATCCGCGGCATGGGAATGCCTCTCCTCGTAGTG 

1441 + + + + + + 1500 

CCACGCC7VCAGCTACCACACCAGCCCTTAGGCGCCGTACCCTTACGGAGAGGAGCATCAC 
-6-< PATDITHVPIRPM 
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ATGGGAGTTCCTCGTCCCTCCAGTCTGCCCAAGCACCTCCCCCGGTGAGCTGTCCCGGCC 

1501 + + + + + + 1560 

TACCCTCAAGGAGCAGGGAGGTCAGACGGGTTCGTGGAGGGGGCCACTCGACAGGGCCGG 

GCCCTCCGGCCCCTTCTAGGCAGGTCGCCCGGTGGTGCGGCCCCAGGACGTCACCTCGCC 

1561 + + + + + + 1620 

CGGGAGGCCGGGGAAGATCCGTCCAGCGGGCCACCACGCCGGGGTCCTGCAGTGGAGCGG 

GCACCACCGGGAGCCCCGAGGGGCGAGGTCAGAGGCCGAGCACCTCCTCGGCCAGGGCGG 

1621 + + + + + + 1680 

CGTGGTGGCCCTCGGGGCTCCCCGCTCCAGTCTCCGGCTCGTGGAGGAGCCGGTCCCGCC 
5-* * LGLVEEALA 

TGCCCCGAACACGGGCCTCGATCTTGGCGAAGGCCAGGTCGCGTGTGGTGGAGGTGTCGT 

1681 + + + + + + 1740 

ACGGGGCTTGTGCCCGGAGCTAGAACCGCTTCCGGTCCAGCGCACACCACCTCCACAGCA 
5 TGRVRAE I KAFALDRTTSTD 

CGGCGAACGGGGAGAAGCCGCAGTCGTCGCAGGTTCCCAGTTGCTCGACGGGGATGTAGC 

1741 + + + + + + 1800 

GCCGCTTGCCCCTCTTCGGCGTCT^GCAGCGTCCAAGGGTCAACGAGCTGCCCCTACATCG 
5 DAFPSFGCDDCTGLQEVPIY 

GGGCGGCGAGCAGGATGCGGTCGCGTACCTGCTCGGGGGTCTCGACCACTGGGTCGATCG 

1801 + + + + + + I860 

CCCGCCGCTCGTCCTACGCCAGCGCATGGACGAGCCCCCAGAGCTGGTGACCCAGCTAGC 
5 RAALLI RDRVQE PTEVVPD I 

GGTCGGTCACCCCGAGGAAGACGCGGGCGGCAGGGGGCAGGTGGTCACGGACGATGCTCA 

1861 + + ---- + + + + 1920 

CCAGCCAGTGGGGCTCCTTCTGCGCCCGCCGTCCCCCGTCCACCAGTGCCTGCTACGAGT 
5 PDTVGLFVRAAPPLHDRVIS 

GGACCCGCTCGGGGTCCGCTTCGCCGGCCAGTTCGAGATAGAAGTTGCCCGCCTTGAGCT 

1921 + + + + + + 1980 

CCTGGGCGAGCCCCAGGCGAAGCGGCCGGTCAAGCTCTATCTTCAACGGGCGGAACTCGA 
5 LVREPDAEGALELYFNGAKL 

GGAAGAGCTTGGGCAGCAGTTCGGCGTAGTCGATGTCGAGGCTGTGCGTGGAGTCCTGGT 

1981 + + + + + + 2040 

CCTTCTCGAACCCGTCGTCAAGCCGCATCAGCTACAGCTCCGACACGCACCTCAGGACCA 
5 QFLKPLLEAYDIDLSHTSDQ 

CGCCGCCGGGGCAGGTGTGTACGCCGATGCGGGCGGTTTCCTCGGCGCTGAAGCGCCCCA 

2041 + + + + + + 2100 

GCGGCGGCCCCGTCCACACATGCGGCTACGCCCGCCAAAGGAGCCGCGACTTCGCGGGGT 
5 DGGPCTHVG I RATEEAS FRG 

GGACTTCGTTGTTGAGGGCGATGAAGTCGTCGAGGACGCCGCCGCTGGGGTCGAGCTTGA 

2101 + + + + + + 2160 

CCTGAAGCAACAACTCCCGCTACTTCAGCAGCTCCTGCGGCGGCGACCCCAGCTCGAACT 
5 LVENNLAI FDDLVGGS PDLK 

GGGACAGCCGCCCCTCGGTGAAGTCGAGCTGGACCACGTGTGCCCCCGCGTCCAGGCAGC 

2161 + + + + + + 2220 

CCCTGTCGGCGGGGAGCCACTTCAGCTCGACCTGGTGCACACGGGGGCGCAGGTCCGTCG 
5 LSLRGETFDLQVVHAGADLC 

CTCGGATGTCGGCTTCGGCCTCGTCGGCGAGGTCGCGCAGGAACTGCTCGCGGGGGTAGC 

2221 + + + + + + 2280 

GAGCCTACAGCCGAAGCCGGAGCAGCCGCTCCAGCGCGTCCTTGACGAGCGCCCCCATCG 
5 GRIDAEAEDALDRLFQERPY 

CCTCGATGGGAGTGGCGGGGTAGAGGAGGCTGAGGGCGGAGGGTGCGATGACCGCCTGCT 
2281 + + + + + + 2340 
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GGAGCTACCCTCACCGCCCCATCTCCTCCGACTCCCGCCTCCCACGCTACTGGCGGACGA 
-5 GE I PTAPYLLSLAS PAIVAQ 

TCAGGGGGCGGTCCGTGAGCTGCCGTGCGGCGCGCAGATAGGTTTCGGCCCGCACCTGGT 

2341 + + + + + + 2400 

AGTCCCCCGCCAGGCACTCGACGGCACGCCGCGCGTCTATCCAAAGCCGGGCGTGGACCA 
-5 KLPRDTLQRAARLYTEARVQ 

AGCGGAAGGGCCCTTGGGTGATGCTGGGGAGCTGCCGGGTGTGCCCGTCTGCGAAGGGGA 

2401 + + + + + + 2460 

TCGCCTTCCCGGGAACCCACTACGACCCCTCGACGGCCCACACGGGCAGACGCTTCCCCT 
_5 YRFPGQTI SPLQRTHGDAFP 

TGACAGCGCCGTCGGGCGAGAGGGTGTCGAGGCCGGTCACGGGGTAGGTGGCGAAGCTCG 

2461 + + + + + + 2520 

ACTGTCGCGGCAGCCCGCTCTCCCACAGCTCCGGCCAGTGCCCCATCCACCGCTTCGAGC 
-5 IVAGDPSLTDLGTVPYTAFS 

GCTTGGACTGTTCACCGTCCACGAGGACGGGGCTGCCGACTCGTTCCAGTCGTGTCAGGG 

2521 + + "-- + + + + 2580 

CGAACCTGACAAGTGGCAGGTGCTCCTGCCCCGACGGCTGAGCAAGGTCAGCACAGTCCC 

-5 PKSQEGDVLVPSGVRELRTL 

TGTCCGCGACGGCCTGTTCCTGCTGTTTGGCCAGGTCCGTGGCGTCCAGGGTTCCCTGGG 

2581 + + + + + + 2640 

ACAGGCGCTGCCGGAGAAGGACGACAAACCGGTCCAGGCACCGCAGGTCCCAAGGGACCC 
-5 TDAVAQEQQKALDTADLTGQ 

CATGCGCGGCAAGGGCGTGCAGGAGTGTCGCGGAGCGCGGAAGGCTGCCGATCGGCTCAG 

2641 + + + + + + 2700 

GTACGCGCCGTTCCCGCACGTCCTCACAGCGCCTCGCGCCTTCCGACGGCTAGCCGAGTC 

-5 AHAALAHL LTAS R P L S G I P E 

TGGCGATGGTCATGGCCGAAGAGTAGGGAAGAGGCTGGGTTTCGAACCACCGCAAAGCTT 

2701 + + + + + + 2760 

ACCGCTACCAGTACCGGCTTCTCATCCCTTCTCCGACCCAAAGCTTGGTGGCGTTTCGAA 

-5-< T A I T M - 

TGATTGCCGCTTTTTCAGGGGAAGTTGATGCGAAGTCGCCGAGCGGCGGAACGTGCTGAT 

2761 + + + + + + 2820 

ACTAACGGCGAAAAAGTCCCCTTCAACTACGCTTCAGCGGCTCGCCGCCTTGCACGACTA 

GTATGGGGGGCGGGAGGAGCCTGCGGGGTTCTAGGAGCCGGTCGCGGCCACGGTGGAGGA 

1 — — — — i — — ^ — — + — — — h h 2880 

CATACCCCCCGCCCTCCTCGGACGCCCCAAGATCCTCGGCCAGCGCCGGTGCCACCTCCT 
_ 4 _* *SGTAAVTSS- 

GGTGCCCAGCTGGGAGCGGGGGGTCTTTTCGCCGACGCGGTTGGGCTCGATGGTGCGGGG 

2881 + + + + + + 2940 

CCACGGGTCGACCCTCGCCCCCCAGAAAAGCGGCTGCGCCAACCCGAGCTACCACGCCCC 
-4 TGLQSRPTKEGVRNPE ITRP- 

GTCGACGGCCTCTCCGGGGGCACCTTGCCGGTAGACGCCTTCGGGGTCGGAGTCCCGGTC 

2941 + + + + + + 3000 

CAGCTGCCGGAGAGGCCCCCGTGGAACGGCCATCTGCGGAAGCCCCAGCCTCAGGGCCAG 
-4 DVAEGPAGQRYVGE PDSDRD- 

ATGGGGGAGCAGGAAGAAGACCCGGCGCCGGTACAGACCGCTGTCCGGGTCCGCTTCGGC 

3001 + + + + + + 3060 

TACCCCCTCGTCCTTCTTCTGGGCCGCGGCCATGTCTGGCGACAGGCCCAGGCGAAGCCG 
-4 H PL LF FVRRRYLGS D PDAEA- 

GTCGGCCCCGAGTTCGATGTAGCCGATCATGCGGCCGTCGCGGGCGTAGCGCGGCTTGTT 

3061 + + + + + + 3120 

CAGCCGGGGCTCAAGCTACATCGGCTAGTACGCCGGCAGCGCCCGCATCGCGCCGAACAA 
-4 DAGLE I YG I MRGDRAYRPKN- 
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m 

CTTGCGCCGGGGGGTCTTGTCCAGGGCCTGGCGGACGTAGTCGAGTCCCTCGGGATCTTC 

3121 + + + + + + 3180 

GAACGCGGCCCCCCAGAACAGGTCCCGGACCGCCTGCATCAGCTCAGGGAGCCCTAGAAG 
-4 KRRPTKDLAQRVYDLGE PDE- 

GAGCCACACGACCTTCGCCTCGTGAACGAGATCGCTGTCGGTCAGTAGCGAGCTCATGGC 

3181 + + + + + + 3240 

CTCGGTGTGCTGGAAGCGGAGCACTTGCTCTAGCGACAGCCAGTCATCGCTCGAGTACCG 
-4-< LWVVKAEHVLDS DTLLS S M - 

GGCGACCTCTCCTTCGTCGGCGTGCACCGGGTGGGGAAGCGGTGCCTGCGTGATGTGTGT 

3241 + + + + + + 3300 

CCGCTGGAGAGGAAGCAGCCGCACGTGGCCCACCCCTTCGCCACGGACGCACTACACACA 

TCGTCTGCGGCGGTGGGCCGCAGTGGTGCGGACCGCCCGTGGTGCCGGTTCTCGGCCAAA 

3301 + + + + + + 3360 

AGCAGACGCCGCCACCCGGCGTCACCACGCCTGGCGGGCACCACGGCCAAGAGCCGGTTT 

GCACGGGCAGGTACGTCCTGGGGCACTCACATCGTAGATGGGGTCCGCTTCCGCAGGGCA 

3361 + + + + + + 3420 

CGTGCCCGTCCATGCAGGACCCCGTGAGTGTAGCATCTACCCCAGGCGAAGGCGTCCCGT 

GTGCCTCCGGTCGGAGGACGTTCATTCGTCGGCTGCCAGAGCGAGGTTGGGGTAGAACTT 

3421 + + + + + + 3480 

CACGGAGGCCAGCCTCCTGCAAGTAAGCAGCCGACGGTCTCGCTCCAACCCCATCTTGAA 
_3_* * EDAALALNPYFK- 

CCGGCCGTTGGATTTGATCATGTCGGCAGGTGAGGCGAGGCCCACTTCCTGGCGGACCCG 

3481 + + + + + + 3540 

GGCCGGCAACCTAAACTAGTACAGCCGTCCACTCCGCTCCGGGTGAAGGACCGCCTGGGC 
-3 RGNS KI MDAPSALGVEQRVR- 

GGTGGCGAAGGCACGGGCGGTCCCGGGGCGGATGCCTTCACTGTGTGCGCACCAGGTGCT 

3541 + + + + + + 3600 

CCACCGCTTCCGTGCCCGCCAGGGCCCCGCCTACGGAAGTGACACACGCGTGGTCCACGA 
-3 TAFARATGPRIGESHACWTS- 

GTAGGACGTGTAGAGAAGGCCCTGTTCGACGCGTAGCTCGCTGTTCTCGGGGTCGTGGAG 

3601 + + + + + + 3660 

CATCCTGCACATCTCTTCCGGGACAAGCTGCGCATCGAGCGACAAGAGCCCCAGCACCTC 
_3 YSTYLLGQEVRLESNEPDHL- 

GCAGCACTCGGCGAGGAAGCGGCCGATGTGGTCCTCGGTGTTCGCGTATGCGCTGGTGGC 

3661 + + + + + + 3720 

CGTCGTGAGCCGCTCCTTCGCCGGCTACACCAGGAGCCACAAGCGCATACGCGACCACCG 
-3 CCEALFRGIHDETNAYASTA- 

GATGCGGACCCGGTCGGGGCCGGCGAGTGTGTCGCGGGTGGCGAGGTAGCGGCGGGCCCC 

3721 + + + + + + 3780 

CTACGCCTGGGCCAGCCCCGGCCGCTCACACAGCGCCCACCGCTCCATCGCCGCCCGGGG 
-3 I RVRD PGALTDRTALYRRAG- 

TTCGGTGAGCCAGTGCAGGATCCCGGGGCCCTCGTCCTGGACGAGTTCGACAGCCAGGTT 

3781 + + + + + + 3840 

AAGCCACTCGGTCACGTCCTAGGGCCCCGGGAGCAGGACCTGCTCAAGCTGTCGGTCCAA 
-3 ETLWHL I GPGEDQVLEVALN- 

GTCGATCTTGCGTTCGTCGGGGACGATCCGTTCGAAGGGCAGGAGGCGGATGCGGCGCCA 

384 1 + + + + + + 3900 

CAGCTAGAACGCAAGCAGCCCCTGCTAGGCAAGCTTCCCGTCCTCCGCCTACGCCGCGGT 
-3 DIKREDPVIREFPLLRIRRW- 

GAAGGCGAAGCCGCCGGTGGAGACCTCGGGGCGGTGGTTGCCCAGCAGCCACAGCTTGTG 

3901 + + + + + + 3960 

CTTCCGCTTCGGCGGCCACCTCTGGAGCCCCGCCACCAACGGGTCGTCGGTGTCGAACAC 
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-3 FA FGGTSVE PRHNGLLWLKH- 

CGTGGGTGTGAAGGAGAAATAGTCCTGCCGCATGCGGCGGGCCTTGATCTTGTCACCGCC 

3961 + + + + + + 4020 

GCACCCACACTTCCTCTTTATCAGGACGGCGTACGCCGCCCGGAACTAGAACAGTGGCGG 
-3 TPTFSFYDQRMRRAKI KDGG- 

GGTCAGCAGGCGGACGCGCGCCTCGTCGAAGCGGTCGTTGGGCTTGAGCTCGCTGCACAC 

4021 + + + + + + 4080 

CCAGTCGTCCGCCTGCGCGCGGAGCAGCTTCGCCAGCAACCCGAACTCGAGCGACGTGTG 
-3 TLLRVRAEDFRDNPKLESCV- 

GATGAGGCGGCGGCCGTGGAGTTCGGTGAGCTCGGTGGAGTGTTCGGAGTATGCGCCACG 

4081 + + + + + + 4140 

CTACTCCGCCGCCGGCACCTCAAGCCACTCGAGCCACCTCACAAGCCTCATACGCGGTGC 

-3 ILRRGHLETLETSHESYAGR- 

GTCCATGAGGAAACCCGGCGGGGCTGCGTCGGCGTAGTCGCCGAGAATCTGGATCATCAC 

4141 + + + + + + 4200 

CAGGTACTCCTTTGGGCCGCCCCGACGCAGCCGCATCAGCGGCTCTTAGACCTAGTAGTG 
-3 DMLFGP PAADAYDGL I Q I M V - 

GTCGAGGAGAACGGATTTGCCGTTCTTTCCCTGGCCGTGGAGAAAGGGCAGCACCTGCGC 

4201 + + + + + + 4260 

CAGCTCCTCTTGCCTAAACGGCAAGAAAGGGACCGGCACCTCTTTCCCGTCGTGGACGCG 
-3 DLLVS KGNKGQGHL F P LVQA- 

CCCGACGTCACCGGTGATGGAGTAGCCGAGAAGGAGGTGGAGGAAGTCGATCATCTCCCG 

4261 + + + + + + 4320 

GGGCTGCAGTGGCCACTACCTCATCGGCTCTTCCTCCACCTCCTTCAGCTAGTAGAGGGC 
-3 GVDGTI SYGLLLHLFDIMER- 

CCCTTCGGCGTCACTGCCGAAGGTGTCTTCGAGGAAACGGTGCCAGCGGGGGGTGGGGAT 

4321 + + + + + + 4380 

GGGAAGCCGCAGTGACGGCTTCCACAGAAGCTCCTTTGCCACGGTCGCCCCCCACCCCTA 
-3 GEADSGFTDELFRHWRPTPI- 

GTCCTGGGGGGAGGCGCTGGTGGCGCGGGAGTGGAAGTCCCGGGTGGGGTCGGGCTTGCG 

4381 + + + + + + 4440 

CAGGACCCCCCTCCGCGACCACCGCGCCCTCACCTTCAGGGCCCACCCCAGCCCGAACGC 
-3 DQPSASTARSHFDRTPD PKR- 

CATACGGCCGTTGCGGAGGTCGACCACTCCGTCAGGGGTGCACAGGGCGTAGGGGTCTCC 

4441 + + + + + + 4500 

GTATGCCGGCAACGCCTCCAGCTGGTGAGGCAGTCCCCACGTGTCCCGCATCCCCAGAGG 
-3 MRGNRLDVVGD PTCLAY PDG- 

GTCGAGGGTGTCGGGATCGAGGGAGAGGTCGGGAGAGGCCTTTGCCTGGGTGAGGAGCGC 

4501 + + + + + + 4560 

CAGCTCCCACAGCCCTAGCTCCCTCTCCAGCCCTCTCCGGAAACGGACCCACTCCTCGCG 
-3 DLTD PDLS LD PSAKAQTLLA- 

CTTCATACCGGTCGTCGACAGGGTGCGGCGTTTGTGGTGGTGCAGTTCCCGGTCGGTGAA 

4561 + + + + + + 4620 

GAAGTATGGCCAGCAGCTGTCCCACGCCGCAAACACCACCACGTCAAGGGCCAGCCACTT 
-3 KMGTTS LTRRKHHHLERDTF- 

CAGCCCGCGGGGATCGCTGCCGGGCATCTCCTCCGCCATCTCTCCGGCAGCCCACAGGGC 

4621 + + + + + + 4680 

GTCGGGCGCCCCTAGCGACGGCCCGTAGAGGAGGCGGTAGAGAGGCCGTCGGGTGTCCCG 
-3 LGRPDSGPMEEAMEGAAWLA- 

AGCTTTCTCGCCTCCGGCCCGCTTCCACCGGTAGCCGTCCCAGGAGTACCAGCCCAGGCC 

4681 + + + + + + 4740 

TCGAAAGAGCGGAGGCCGGGCGAAGGTGGCCATCGGCAGGGTCCTCATGGTCGGGTCCGG 
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3 AKEGGARKWRYGDWS YWGLG- 

CTCCACGTGCCGGAACTGGTCACGGTAGAGACGGACGAAGAGCTTGGCGTTGCCGCGGTC 

474! + + + + + + 4800 

GAGGTGCACGGCCTTGACCAGTGCCATCTCTGCCTGCTTCTCGAACCGCAACGGCGCCAG 
3 EVHRFQDRYLRVFLKANGRD- 

GGTCAGGCTGGCGGGAATCTCGCCCGCCTCCCAGGCGGTCGCGGCGACGGGGGCCTCGGG 

48 0i + + + + + + 4860 

CCAGTCCGACCGCCCTTAGAGCGGGCGGAGGGTCCGCCAGCGCCGCTGCCCCCGGAGCCC 
3 TLSAP I EGAEWATAAVPAE P - 

AGCGGCCTGGACAGGGAGGAGCGGCGCTGGGGCCGGGGTGGTTTCGAGGGCCAGCATCTG 

4861 + + + + + + 4920 

TCGCCGGACCTGTCCCTCCTCGCCGCGACCCCGGCCCCACCAAAGCTCCCGGTCGTAGAC 
3 AAQVPLL PAPAPTTELALMQ- 

CTGAGCGGCGGCAGTTGCGTCAAAGCGAGGGCCCTCGGCGCTGCTGCTCATGGACGTCCT 

4921 + + + + + + 4980 

GACTCGCCGCCGTCAACGCAGTTTCGCTCCCGGGAGCCGCGACGACGAGTACCTGCAGGA 
3-< QAAATADFRPGEAS S S M - 

TCGAGATGGAGCGGTCGGGCGGTCCCCGCTGCGGGAACGGCATGAATGATCTTCCCGGTG 

4981 + + + + + + 5040 

AGCTCTACCTCGCCAGCCCGCCAGGGGCGACGCCCTTGCCGTACTTACTAGAAGGGCCAC 

CGGACAGAGTGCCAGGGGCAGCGCATGTGCGGGGGGACAACGGCCCGTTTCGGACGAGGG 

5041 + + + + + + 5100 

GCCTGTCTCACGGTCCCCGTCGCGTACACGCCCCCCTGTTGCCGGGCAAAGCCTGCTCCC 

CCGGCCGACGGGGGGAAGCAGGGGCCGGCAACCGGGTGGCGGGGCGGCGTGAGCGAGGGC 

5101 + + + + + + 5160 

GGCCGGCTGCCCCCCTTCGTCCCCGGCCGTTGGCCCACCGCCCCGCCGCACTCGCTCCCG 

ACGAGCGGCCCGGTACGGGGGGAAGGGCTCGTCTCTCCGTGGGGCGGCACGTTGTGGTCC 

5161 + + + + + + 5220 

TGCTCGCCGGGCCATGCCCCCCTTCCCGAGCAGAGAGGCACCCCGCCGTGCAACACCAGG 

TCGTCCGTCAGCTTGCGTCTGGCTTCAGCCTCCTGACCCCCAATAAGGCGAAAGCTGCTG 

5221 ~ — j_ _ _ _ — _ + — — — — — — — — — (- — — — — — + 5280 

AGCAGGCAGTCGAACGCAGACCGAAGTCGGAGGACTGGGGGTTATTCCGCTTTCGACGAC 

GTCAAGCATCTTTCGTGACACTCGGCGAGGGACTGAAGGGACTGTCTTTCGGAATGAGTG 

5281 + + + + + + 5340 

CAGTTCGTAGAAAGCACTGTGAGCCGCTCCCTGACTTCCCTGACAGAAAGCCTTACTCAC 

TAGGGGGTTGTCGGGTGGGGACCGCGCCTCGACTCCCCGGCGGACGGGATCTGTTCGGTC 

5341 + + + + + + 5400 

ATCCCCCAACAGCCCACCCCTGGCGCGGAGCTGAGGGGCCGCCTGCCCTAGACAAGCCAG 

GGTCCCTTGGGTCCCTCCCCGGATCGCGGCAGGGACCCAAGGGGGCGGTGCGGCGGGCGG 

5401 - + + + + + + 5460 

CCAGGGAACCCAGGGAGGGGCCTAGCGCCGTCCCTGGGTTCCCCCGCCACGCCGCCCGCC 

TCGGTGAGGGGCCCCGGTGGAGGGACTGAGGGTCTGTATGGAGCGATAAGAGGGTCTGAA 

5461 + + + + + + 5520 

AGCCACTCCCCGGGGCCACCTCCCTGACTCCCAGACATACCTCGCTATTCTCCCAGACTT 

GGGGCGGAGAGAGTTTCGGTCCCTGCGTTGAGTCCCTGGTCATCACCGCAGGTCAGAGGG 

5521 + + + + + + 5580 

CCCCGCCTCTCTCAAAGCCAGGGACGCAACTCAGGGACCAGTAGTGGCGTCCAGTCTCCC 



GTTTTGAGGGGTGAAAAAGGGACTGAAGGGACTCAACTTCCCCATTATGAGCTGAGTAGA 

+ + + + + + 

CAAAACTCCCCACTTTTTCCCTGACTTCCCTGAGTTGAAGGGGTAATACTCGACTCATCT 
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AGAAAGCAGTATGACGATATCGGCGCCTACATACGCGCGCGTACATAGTGAGCTTATAAT 

5641 + + + + + + 5700 

TCTTTCGTCATACTGCTATAGCCGCGGATGTATGCGCGCGCATGTATCACTCGAATATTA 

GCGGAAGTTGAGTCCCTTCAGTCCCTTTTCGTGGGGTCGTATCCCCTCTGACTGCGTTGA 

5701 + + + + + + 5760 

CGCCTTCT^ACTCAGGGAAGTCAGGGAAAAGCACCCCAGCATAGGGGAGACTGACGCAACT 

CCGTCGCCGCTCCGCGCAGGGACCGAAGAGGGACCAAGTCCCTGCGCGGGGCGGGCGACG 

5761 + + + + + + 5820 

GGCAGCGGCGAGGCGCGTCCCTGGCTTCTCCCTGGTTCAGGGACGCGCCCCGCCCGCTGC 

GTAATCGTGCAGTGCCCCCTCCCCCGTTTCCCACAGCGAGTCGTCGCTCCCCTGTGAGGC 

5821 + + + + + + 5880 

CATTAGCACGTCACGGGGGAGGGGGCAAAGGGTGTCGCTCAGCAGCGAGGGGACACTCCG 

CGGAGAGGGTCCTAGAACCCCTCAGGGGCCGTTCTGTGGCCCTCTGGGCCTCCTCCTGGC 

5881 + + + + + + 5940 

GCCTCTCCCAGGATCTTGGGGAGTCCCCGGCAAGACACCGGGAGACCCGGAGGAGGACCG 

CATTTACCCCATGGGGGCGCTTGGGGGCGTCAGGAGGGCTTGTGAGGGCTCTGCCGGGAA 

5941 + + + + + + 6000 

GTAAATGGGGTACCCCCGCGAACCCCCGCAGTCCTCCCGAACACTCCCGAGACGGCCCTT 
2-> MRALPGS- 

GTGGCGGATTGCGCATGGCAGGAGATGCCCCGACAGCGGCCGGGAATCGACGATGTCCCC 

6001 + + + + + + 6060 

CACCGCCTAACGCGTACCGTCCTCTACGGGGCTGTCGCCGGCCCTTAGCTGCTACAGGGG 
2 GGLRMAGDAPTAAGNRRC P P - 

CGACCCCTATCCAGCGTCCGCTGATCCTCAGGAGGCAGACCTTGCAGGCTCCAGAAGCGA 

6061 + + + + + + 6120 

GCTGGGGATAGGTCGCAGGCGACTAGGAGTCCTCCGTCTGGAACGTCCGAGGTCTTCGCT 
2 TPIQRPLILRRQTLQ APEAK- 

AGAACGGCCGGTCCCCGGAGCAGCCGCAGGAAGAGCGGATCGTCCTGGACGTATGGCTGG 

6121 + + + + + + 6180 

TCTTGCCGGCCAGGGGCCTCGTCGGCGTCCTTCTCGCCTAGCAGGACCTGCATACCGACC 
2 NGRS PEQ PQEER IVLDVWLA- 

CGAACTACCCGTTCCCCACCTATGACGGGCGTGACTTCCTCGCTCCGCTGCGCGAGCGGG 

6181 + + + + + + 6240 

GCTTGATGGGCAAGGGGTGGATACTGCCCGCACTGAAGGAGCGAGGCGACGCGCTCGCCC 
2 NYPFPTYDGRDFLAPLRERA- 

CGGCGGAGTTCGAGCGCGCCCACCCCCGATACCGGGTCGACATCAACGGCCACGACTTCT 

6241 + + + + + + 6300 

GCCGCCTCAAGCTCGCGCGGGTGGGGGCTATGGCCCAGCTGTAGTTGCCGGTGCTGAAGA 
2 AEFERAHPRYRVDINGHDFW- 

GGACCATCCCCGAGAAGGTGGCGCGCGCCACCGCGGAGGGCAGGCCTCCGCACATAGCGG 

6301 + + + + + + 6360 

CCTGGTAGGGGCTCTTCCACCGCGCGCGGTGGCGCCTCCCGTCCGGAGGCGTGTATCGCC 
2 TIPEKVARATAEGRPPHIAG- 

GCTACTACGCCACCGACAGCCAGTTGGCGCGGGACGCGCGCAGGCCCGACGGGAAGCCGG 

6361 + + + + + + 6420 

CGATGATGCGGTGGCTGTCGGTCAACCGCGCCCTGCGCGCGTCCGGGCTGCCCTTCGGCC 
2 YYATDS QLARDARRPDGKPV- 

TCTTCACCTCGGTGGAGGCCGCGTTGGCCGGCCGGACGGAGATACTGGGACACCCGGTGG 

6421 + + + + + + 6480 

AGAAGTGGAGCCACCTCCGGCGCAACCGGCCGGCCTGCCTCTATGACCCTGTGGGCCACC 
2 FTSVEAALAGRTE I LGHPVV- 

TGGTGGAGGACCTCGACCCCGTGGTGCGCGACTCCTACTCGTTCGGGGGCGAGTTGGTGT 
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6481 + + + + + + 6540 

ACCACCTCCTGGAGCTGGGGCACCACGCGCTGAGGATGAGCAAGCCCCCGCTCT^ACCACA 
2 VEDLDPVVRDSYS FGGELVS- 

CGCTGCCGCTCACGGTCACCACCATGCTCTGCTACGCCAACTCCTCCCTCCTCGCGCGCG 

6541 + + + + + + 6600 

GCGACGGCGAGTGCCAGTGGTGGTACGAGACGATGCGGTTGAGGAGGGAGGAGCGCGCGC 
2 h PLTVTTMLCYANS SLLARA- 

CCGGTGTTCCGGAGTTGCCCCGTACCTGGGATGAGGTCGAAGCAGCCTGCCAGGCGGTGG 

6601 + + + + + + 6660 

GGCCACAAGGCCTCAACGGGGCATGGACCCTACTCCAGCTTCGTCGGACGGTCCGCCACC 
2 GVPELPRTWDEVEAACQAVA- 

CCAGCGTCGACGGGGGGCCCGGTCACGGAATCACCTGGGCCAACGACGGCTGGGTTTTCC 

6661 + + + + + + 6720 

GGTCGCAGCTGCCCCCCGGGCCAGTGCCTTAGTGGACCCGGTTGCTGCCGACCCAAAAGG 
2 SVDGGPGHGITWANDGWVFQ- 

AGCAGGCCGTCGCCCTTCAGAACGGGGTGCTGACCGATCAGGACAACGGCCGCTCCGGCT 

6721 + + -- + + + + 6780 

TCGTCCGGCAGCGGGAAGTCTTGCCCCACGACTGGCTAGTCCTGTTGCCGGCGAGGCCGA 
2 QAVALQNGVLTDQDNGRSGS- 

CCGCCACGACGGTGGACGTCACATCGGACGAGATGCTGGACTGGGTCCGCTGGTGGACGC 

6781 + + + + + + 6840 

GGCGGTGCTGCCACCTGCAGTGTAGCCTGCTCTACGACCTGACCCAGGCGACCACCTGCG 
2 ATTVDVTS DEMLDWVRWWTH- 

ACCTCCATGAGCGCGGCCATTACCTCTACACGGGCGGGCCCTCGGACTGGGGCGGGGCGT 

6841 + + + + + + 6900 

TGGAGGTACTCGCGCCGGTAATGGAGATGTGCCCGCCCGGGAGCCTGACCCCGCCCCGCA 
2 LHERGHYLYTGGPSDWGGAF- 

TCGAGGCTTTCGTCCAGCAGAAGGTCGCATTCACCTTCGACTCGTCC7VAGGCCGCCCGGG 

6901 + + + + + + 6960 

AGCTCCGAAAGCAGGTCGTCTTCCAGCGTAAGTGGAAGCTGAGCAGGTTCCGGCGGGCCC 
2 EAFVQQKVAFTFDS SKAARE- 

AACTCATCCAGGCCGGTGCACAGGCCGGTTTCGAGGTCGCGGTGTTCCCGTTGCCCAGGA 

6961 + + + + + + 7020 

TTGAGTAGGTCCGGCC^CGTGTCCGGCCAAAGCTCCAGCGCCACAAGGGCAACGGGTCCT 
2 L I QAGAQAG F EVAVF P L P RN- 

ACGCGAAGGCCCCGGTAGCGGGCCAGCCCGTCTCGGGAGACTCCCTGTGGCTGGCCGCGG 

7021 + + + + + + 7080 

TGCGCTTCCGGGGCCATCGCCCGGTCGGGCAGAGCCCTCTGAGGGACACCGACCGGCGCC 
2 AKAPVAGQPVS GD S LWLAAG- 

GACTCGACGAGACCACGCAGGACGGGCTGCTCGCTCTCACCCAGTACCTGATCAGCCCGG 

7081 + + + + + + 7140 

CTGAGCTGCTCTGGTGCGTCCTGCCCGACGAGCGAGAGTGGGTCATGGACTAGTCGGGCC 
2 LDETTQDGLLALTQYLISPA- 

CCAACGCCGCGGACTGGCACCGCACCAACGGTTTCGTACCGGTGACCGGCGCGGCCGGGG 

7141 + + + + + + 7200 

GGTTGCGGCGCCTGACCGTGGCGTGGTTGCCAAAGCATGGCCACTGGCCGCGCCGGCCCC 
2 NAADWHRTNGFV PVTGAAGE- 

AACTGCTGGAAGCGACAGGCTGGTTCGACCGCCGGCCGCAGCAACGGGTGGCCGGGGAGC 

7201 + + + + + + 7260 

TTGACGACCTTCGCTGTCCGACCAAGCTGGCGGCCGGCGTCGTTGCCCACCGGCCCCTCG 
2 LLEATGWFDRRPQQRVAGEQ- 

AGTTGAAGGCGTCCGACCGGTCACCGGCGGCGCTCGGCGCGCTGCTCGGCGACTTCGCGG 
7261 + + + + + + 7320 
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TCAACTTCCGCAGGCTGGCCAGTGGCCGCCGCGAGCCGCGCGACGAGCCGCTGAAGCGCC 
2 LKAS DRS PAALGALLGDFAA- 

CCGTCAACGAGGTCATCACCGCAGCGATGGACGATGTCCTGCGCAGTGGAGCGGACCCCG 

7321 + + + + + + 7380 

GGCAGTTGCTCCAGTAGTGGCGTCGCTACCTGCTACAGGACGCGTCACCTCGCCTGGGGC 
2 VNEVI TAAMDDVLRSGADPA- 

CGAAGGCCTTCGCCGAAGCCGGCGTGGCCGCCCAGCAACTGCTCGATGCCTACAACGCCC 

7381 + + + + + + 7440 

GCTTCCGGAAGCGGCTTCGGCCGCACCGGCGGGTCGTTGACGAGCTACGGATGTTGCGGG 
2 KAFAEAGVAAQQLLDAYNAR- 

GGAACCGCTCCGGATCCGGGACCCCCTCCGCCGTCTGAGATCCGGTACCGGGGCACAGGG 

7441 + + + + + + 7500 

CCTTGGCGAGGCCTAGGCCCTGGGGGAGGCGGCAGACTCTAGGCCATGGCCCCGTGTCCC 
2-* NRSGSGTPSAV* - 

GCGCCGCCGCCCGCTTTCCCGGCGGGGCACTGGCCGGGGGACATGCTCTCCCGCCCCCGG 

7501 + + + + + + 7560 

CGCGGCGGCGGGCGAAAGGGCCGCCCCGTGACCGGCCCCCTGTACGAGAGGGCGGGGGCC 

CAGGACGTAGGGTCAACCCGCCTGCGCCTTCAGGTGGCGGCGCAGATACTCACCGGTCAG 

7561 + + + + + + 7620 

GTCCTGCATCCCAGTTGGGCGGACGCGGAAGTCCACCGCCGCGTCTATGAGTGGCCAGTC 
1-* * GAQAKLHRRLYEGTL- 

GGAGGAATCCGCGGCGAGCAGGTCCTTCGGTGTGCCGGTGAAGACGATCTCGCCGCCCTC 

7621 + + + + + + 7680 

CCTCCTTAGGCGCCGCTCGTCCAGGAAGCCACACGGCCACTTCTGCTAGAGCGGCGGGAG 
1 S SDAALLDKPTGTFVI EGGE- 

CCGTCCCCCGTCGGGACCCAGGTCGATGATCCAGTCGGCCTGCTGCACCACATCGAGGTT 

7681 + + + + + + 7740 

GGCAGGGGGCAGCCCTGGGTCCAGCTACTAGGTCAGCCGGACGACGTGGTGTAGCTCCAA 
1 RGGDPGLDI IWDAQQVVDLN- 

GTGCTCGATGACCACGACGGTGTTCCCGGCCTCGACGAGCCCGTCCAGGAGCTTCAGCAG 

7741 + + + + + + 7800 

CACGAGCTACTGGTGCTGCCACAAGGGCCGGAGCTGCTCGGGCAGGTCCTCGAAGTCGTC 
1 HE IVVVTNGAEVLGDLLKLL- 

GGTGTCAACGTCCGACATGTGCAGCCCGGTGGTGGGCTCGTCCAGGACATAGACCGTGCC 

7801 + + + + + + 7860 

CCACAGTTGCAGGCTGTACACGTCGGGCCACCACCCGAGCAGGTCCTGTATCTGGCACGG 
1 TDVDSMHLGTTPEDLVYVTG- 

CGTGCGGTGCAGCTGGTCGGCAAGTTTGATCCGCTGCAGTTCACCGCCGGAGAGGCTGGA 

7861 + + + + + + 7920 

GCACGCCACGTCGACCAGCCGTTCAAACTAGGCGACGTCAAGTGGCGGCCTCTCCGACCT 
1 TRHLQDAIjKI RQLEGGS LS S- 

AAGCGGCTGGCCCAGGCTGAGGTACCCAAGACCGACGTCGACGAGAGCGCGCAGTTTCGG 

7921 + + + + + + 7980 

TTCGCCGACCGGGTCCGACTCCATGGGTTCTGGCTGCAGCTGCTCTCGCGCGTCAAAGCC 
1 LPQGLSLYGLGVDVLARLKP- 

CAGCAGGGCCTTCTCGGTGAAGAACTCGACGGCCTCGTCGGCGGGCAGCTCCAGGACGTC 

7981 + + + + + + 8040 

GTCGTCCCGGAAGAGCCACTTCTTGAGCTGCCGGAGCAGCCGCCCGTCGAGGTCCTGC^AG 
1 LLAKETFFEVAEDAPLELVD- 

CGCGATCGACTTCCCGCGAAGCTGGTGCTCCAGGACCTCGGGCTTGAAGCGGCGCCCCTC 

8041 + + + + + + 8100 

GCGCTAGCTGAAGGGCGCTTCGACCACGAGGTCCTGGAGCCCGAACTTCGCCGCGGGGAG 
1 AI S KGRLQHELVE PKFRRGE- 



10 



ACAGACACCGCAGTGCGTGGTCACCGGATCCATGAAGGCCAGCTCGGTGATGATGACCCC 

8101 + + + + + + 8160 

TGTCTGTGGCGTCACGCACCAGTGGCCTAGGTACTTCCGGTCGAGCCACTACTACTGGGG 
1 CVGCHTTVPDMFALETIIVG- 

GCGGCCCTGGCACTCCTCGCACGACCCCTTGGAGTTGAAGCTGAACAGCGAGGCGTTCGC 

8161 + + + + + + 8220 

CGCCGGGACCGTGAGGAGCGTGCTGGGGAACCTCAACTTCGACTTGTCGCTCCGCAAGCG 
1 RGQCEECSGKSNFS FLSANA- 

GCCGGTCTCCTTCGCGAACAGCTTGCGCAGCGGGTCCATCAGGCCGAGGTAGGAGACCGG 

8221 + + + + + + 8280 

CGGCCAGAGGAAGCGCTTGTCGAACGCGTCGCCCAGGTAGTCCGGCTCCATCCTCTGGCC 
1 GTEKAFLK RLPDMLGLYSVP- 

TGTGGAGCGCGACGAGGCGGCGATCGCGGACTGGTCGACAAAGACCGCGTCGGGGTGCGC 

8281 + + + + + + 8340 

ACACCTCGCGCTGCTCCGCCGCTAGCGCCTGACCAGCTGTTTCTGGCGCAGCCCCACGCG 
1 TSRS SAAIASQDVFVADPHA- 

CTCCATGAATGCCCCGGAGATCAGGCTGCTCTTGCCGGAACCCGCCAGCCCGGTCACCGC 

8341 + + + + + + 8400 

GAGGTACTTACGGGGCCTCTAGTCCGACGAGAACGGCCTTGGGCGGTGGGGCCAGTGGCG 
1 EMFAGS ILSSKGSGAVGTVA- 

GGTCAGCACACCGGTGGGCACGGCCACGGAGACCTGCTTCAGGTTGTGGAGATCCGCGTT 

8401 + + + + + + 8460 

CCAGTCGTGTGGCCACCCGTGCCGGTGCCTCTGGACGAAGTCCAACACCTCTAGGCGCAA 
1 TLVGTPVAVSVQKLNHLDAN- 

CTCCACGGTCAGCTCCCCCGTGGGCGGGCGGACCTCCTCCTTCACGCGGGCCCCCCGCCG 

8461 + + + + + + 8520 

GAGGTGCCAGTCGAGGGGGCACCCGCCCGCCTGGAGGAGGAAGTGCGCCCGGGGGGCGGC 
1 EVTLEGT P PRVEEKVRAGRR- 

CAGAGCCTCCCCGGTCCGGGTCTTCGCCTTCCGCAGCTTCGCGAAGGACCCCTCGAACAC 

8521 + + + + + + 8580 

GTCTCGGAGGGGCCAGGCCCAGAAGCGGAAGGCGTCGAAGCGCTTCCTGGGGAGCTTGTG 
1 LAEGTRTKAKRLKAFSGEFV- 

GATCTCGCCCCCGTGCACTCCCGCCCCGGGACCGACATCGACGATGTGGTCGGCGATCTC 

8581 + + + + -- + + 8640 

CTAGAGCGGGGGCACGTGAGGGCGGGGCCCTGGCTGTAGCTGCTACACCAGCCGCTAGAG 
1 I EGGHVGAGPGVDV I HDAI E- 

GATCACAtcGGGGTCGTGCTCGACGACCAGCACGGTGTTCCCCTTGTCGCGCAGCGCGCG 

8641 + + + + + + 8700 

CTAGTGTagCCCCAGCACGAGCTGCTGGTCGTGCCACAAGGGGAACAGCGCGTCGCGCGC 
1 IVD PDHEVVLVTNGKDRLAR- 

CAGCAGGTCGTTGAGCCGCCCCACGTCGCGCGGGTGCAGGCCGATGCTGGGCTCGTCGAA 

8701 + + + + + + 8760 

GTCGTCCAGCAACTCGGCGGGGTGCAGCGCGCCCACGTCCGGCTACGACCCGAGCAGCTT 
1 LLDNLRGVDRPHLG I S PEDF- 

GATGTACGTGAGCCCGGCCAGACCACTGCCGAGGTGGCGCACCATCTTCAGCCGCTGCCC 

8761 + + + + + + 8820 

CTACATGCACTCGGGCCGGTCTGGTGACGGCTCCACCGCGTGGTAGAAGTCGGCGACGGG 
1 I YTLGALGS GLHRVMKLRQG- 

CTCGCCCCCCGAGAGGTCGGCCGTGGGCCTGTCCAGGGTCAGGTAGCCGAGCCCGATGGA 

8821 + + + + + + 8880 

GAGCGGGGGGCTCTCCAGCCGGCACCCGGACAGGTCCCAGTCCATCGGCTCGGGCTACCT 
1 EGGSLDATPRDLTLYGLGIS- 



11 




CACGATCCGCTCCAGGGCCGTGCGCGCGGCTTTCGCGAGAGGGGCAGCGGCCGGCTCCGT 

8881 + + + + + + 8940 

GTGCTAGGCGAGGTCCCGGCACGCGCGCCGAAAGCGCTCTCCCCGTCGCCGGCCGAGGCA 
VI RELATRAAKAL PAAAPET- 

GACGCCGGCGAGCACCTCCGTGAGGTCGCGGACCTCCATGCTCGAGTAGTCGGCGATGTT 

8941 + + + + + + 9000 

CTGCGGCCGCTCGTGGAGGCACTCCAGCGCCTGGAGGTACGAGCTCATCAGCCGCTACAA 
VGALVETLDRVEMSSYDAIN- 

CTTGCCGTCGATCCGGACGTCGAGCGCGGCGGCGTTGAGCCGCGCGCCCCGGCAGGAGGG 

900 i + + + + + + 9060 

GAACGGCAGCTAGGCCTGCAGCTCGCGCCGCCGCAACTCGGCGCGCGGGGCCGTCCTCCC 
KGD IRVDLAAANLRAGRCS P - 

ACAGACTCCGTCgGTGACGAAACGTTCGATGACCTCGCGCTTGCGGTCGcTCAGCGCGCT 

9061 + + + + + + 9120 

TGTCTGAGGCAGcCACTGCTTTGCAAGCTACTGGAGCGCGAACGCCAGCgAGTCGCGCGA 
CVGDTVFRE IVERKRDSLAS- 

GAGGTCGCGCTTGAGGTTGAgCCGCTCGAACCGGTCGGCcAACCCCTCGTAGTTCGTCTG 

912 1 + + + + + + 9180 

CTCCAGCGCGAACTCCAACTcGGCGAGCTTGGCCAGCCGgTTGGGGAGCATCAAGCAGAC 
LDRKLNLRE FRDALGEYNTQ- 

GAACTCGGTGCTCTTGGTCTTCAGCGTcACCTTCCCGCCGGTGcCGCGCAGCAGCGTGTC 

9181 + + + + + + 9240 

CTTGAGCCACGAGAACCAGAAGTCGCAgTGGAAGGGCGGCCACgGCGCGTCGTCGCACAG 
FETSKTKLTVKGGTGRLLTD- 

CAGCTCCTCGGCGCTGTACTCGGCGATCGGCTTGGCCGGATCCAGACGGCCGGACTTCGC 

9241 + + + + + + 9300 

GTCGAGGAGCCGCGACATGAGCCGCTAGCCGAACCGGCCTAGGTCTGCCGGCCTGAAGCG 
LEEASYEAI PKAPDLRGSKA- 

CCAGATCTGCCAGTCCGGGCTACCCACCTTGTACTCGGGGAAAAGGACCGCCCCGTCGTC 

9301 + + + + + + 9360 

GGTCTAGACGGTCAGGCCCGATGGGTGGAACATGAGCCCCTTTTCCTGGCGGGGCAGCAG 
W I QWDPS GVKYE P FLVAGDD- 

CAGGGACTTCGAGCGGTCCAGCATCTTGTCCAGGTCGAGGGCGATGCTCTGGCCGAGACC 

9361 + + + + + + 9420 

GTCCCTGAAGCTCGCCAGGTCGTAGAACAGGTCCAGCTCCCGCTACGAGACCGGCTCTGG 
LSKSRDLMKDLDLAI SQGLG- 

GTCGCAGTCCGGGCACATGCCCTGGGGGTCGTTGAACGAGAACGCGGAGACGCCGAGCGA 

9421 + + + + + + 9480 

CAGCGTCAGGCCCGTGTACGGGACCCCCAGCAACTTGCTCTTGCGCCTCTGCGGCTCGCT 
DCDPCMGQPDNFS FASVGLS- 

GGACGGCCCGTCGTCCTTCGTCGTGCCGAACCGTGCGAACAGGGCCCGGATCATCGGCTG 

9481 + + + + + + 9540 

CCTGCCGGGCAGCAGGAAGCAGCACGGCTTGGCACGCTTGTCCCGGGCCTAGTAGCCGAC 
S PGDDKTTGFRAFLARI MPQ- 

TACGTCCGTCATGGTCCCCACCGTGGACCGGGCGTTGCCCCCCACGGGCTTCTGGTCGAC 

9541 + + + + + + 9600 

ATGCAGGCAGTACCAGGGGTGGCACCTGGCCCGCAACGGGGGGTGCCCGAAGACCAGCTG 
VDTMTGVT S RANGGVP KQDV- 

GATCACCGGGGTGGTGAGGTTCTCGATCGCCTCGGCCTGAGGACGTTCGTACTTCGGAAG 

960 i + + + + + + 9660 

CTAGTGGCCCCACCACTCCAAGAGCTAGCGGAGCCGGACTCCTGCAAGCATGAAGCCTTC 
IVPTTLNEIAEAQPREYKPL- 

CTGGTTGCGGATGTACCAGCTGAAGGTGGAGTTCAGCTGTCGCTGGGCCTCCACGGCCAC 
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9661 + + + + + 

GACCAACGCCTACATGGTCGACTTCCACCTCAAGTCGACAGCGACCCGGAGGTGCCGGTG 
QNR I YWS FTSNLQRQAEVAV- 

CGTGTCGAAGACGATCGACGACTTGCCCGAACCCGAGACCCCCGTGAAGACCGTGATCTG 

9721 + + -- + + + + 9780 

GCACAGCTTCTGCTAGCTGCTGAACGGGCTTGGGCTCTGGGGGCACTTCTGGCACTAGAC 
TDFVI SSKGSGSVGTFVTIQ- 

GTTGCGGGGAATCGTCAGGGAGACATCTTTGAGGTTGTGGATCCGCGCGCCCGCGATGCG 

9781 + + + + + + 9840 

CAACGCCCCTTAGCAGTCCCTCTGTAGAAACTCCAACACCTAGGCGCGCGGGCGCTACGC 
NRP I TLSVDKLNH I RAGAI R - 

GATGCCGTCTCCCGGGCCGGATGTTTTTCCCGCGCCGGCGGTGGGGTCGGTGACGCTCAC 

9841 + + + + + + 9900 

CTACGGCAGAGGGCCCGGCCTACAAAAAGGGCGCGGCCGCCACCCCAGCCACTGCGAGTG 
< IGDGPGSTKGAGATPDTVSM- 

AGAGTTTTCCTCCTGGCTTCCGTACATGATTTACCGTGTCAGCCGGGCAAACCGGCGGAA 

9901 + + + + + + 9960 

TCTCAAAAGGAGGACCGAAGGCATGTACTAAATGGCACAGTCGGCCCGTTTGGCCGCCTT 

CGGTAACCACCTAGCTTGTACTCAGGAGGTGTCCGGGGTCTTCTCCTCCCGTGCTGACTT 

9961 + + + + + + 10020 

GCCATTGGTGGATCGAACATGAGTCCTCCACAGGCCCCAGAAGAGGAGGGCACGACTGAA 

* STD PTKE E RAS K - 

GGGGGCCGGCCCGCCGGACAGGGCCGGCTCCGTGTTCCACCCCGCCAGCCGATCCCCCCG 

10021 + + + + + " + 10080 

CCCCCGGCCGGGCGGCCTGTCCCGGCCGAGGCACAAGGTGGGGCGGTCGGCTAGGGGGGC 

PAPGGSLAPETNWGAL RDGR- 

CTCCGTCTCGTCCTCCTCGAGAACGATCCGGCTGCTCGCCCAGCGCAGGATCGGCGGCGC 

10081 + + + + + "--+ 10140 

GAGGCAGAGCAGGAGGAGCTCTTGCTAGGCCGACGAGCGGGTCGCGTCCTAGCCGCCGCG 
ETEDEELVIRSSAWRL I PPA- 

CGTCACCGAGGTGATGAGGGCGACCAGCACGATGATCGTGAAGGTCACGGTGTGCAGTAC 

10141 + + + + + + 10200 

GCAGTGGCTCCACTACTCCCGCTGGTCGTGCTACTAGCACTTCCAGTGCCACAGGTCATG 

TVSTILAVLVI ITFTVTDLV- 

GCCGATACGCAGGCCGACCAGGGCGATCACCACCTCGATCATTCCACGCGAGTTCATCCC 

10201 + + + + + + 10260 

CGGCTATGCGTCCGGCTGGTCCCGCTAGTGGTGGAGCTAGTAAGGTGCGCTCAAGTAGGG 
GI RLGVLAIVVE IMGRSNMG- 

CGCTCCGAGCGCCAGCCCCTCGTAGCGGCTCATCCCGCCACTACGGGCGGCGACGTACGC 

10261 + + + + + + 10320 

GCGAGGCTCGCGGTCGGGGAGCATCGCCGAGTAGGGCGGTGATGCCCGCCGCTGCATGCG 

AGLALGEYRSMGGS RAAVYA- 

ACCGGCGAACTTGCCGAAAGTGGCCACCAACAGCACCCCGAGGCCCGTGAGCAGCACCGA 

10321 + + + + + + 10380 

TGGCCGCTTGAACGGCTTTCACCGGTGGTTGTCGTGGGGCTCCGGGCACTCGTCGTGGCT 

GAFKGFTAVLLV.GLGTLLVS- 

CGGCTCCGCGAGTGCGGTCAGGTCCATGCGAAGCCCCACACTGCCCAGGAACACCGGTGC 

10381 + + + + + + 10440 

GCCGAGGCGCTCACGCCAGTCCAGGTACGCTTCGGGGTGTGACGGGTCCTTGTGGCCACG 

PEALATLDMRLGVSGLFVPA- 

GAACACGGCCATGACCAGCGTGCGCAGCGGGGCGAGCCGTACCGGGGCGATGTGCCTCAG 

10441 + --- + + + + + 10500 

CTTGTGCCGGTACTGGTCGCACGCGTCGCCCCGCTCGGCATGGCCCCGCTACACGGAGTC 
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CAGGGTCGCACCGGCCACGAACGCCCCGAACAACGCCTCCATCCCGGCCGCCGCGGTCAG 

10501 + + + + + 10560 

GTCCCAGCGTGGCCGGTGCTTGCGGGGCTTGTTGCGGAGGTAGGGCCGGCGGCGCCAGTC 
0 LTAGAVFAG FLAEMGAAATL- 

CGCCCCGTACAGGACGACCACGGCCACGCCGACGGTGACGGCCGATACGGGGACCCGGCT 

10561 + + + + + + 10620 

GCGGGGCATGTCCTGCTGGTGCCGGTGCGGCTGCCACTGCCGGCTATGCCCCTGGGCCGA 
0 AGYLVVVAVGVTVASVPVRS- 

GTCACCCGTACGGGACAGCCGCCTGCCGATCGGGCCGCCCACCGCACACGCCGCGGCGAC 

1062 i + + + + + + 10680 

CAGTGGGCATGCCCTGTCGGCGGACGGCTAGCCCGGCGGGTGGCGTGTGCGGCGCCGCTG 
0 DGTRSLRRGI PGGVACAAAV- 

GAAGACGGTCGTCCAGGCCATCGTGGTCAGGACCACGGGCCCCCCGGCCGCCCCACTCGC 

10681 + + + + + + 10740 

CTTCTGCCAGCAGGTCCGGTAGCACCAGTCCTGGTGCCCGGGGGGCCGGCGGGGTGAGCG 
0 FVTTWAMTTLVVPGGAAGSA- 

CAGCGCCGTCACCAGAGCGAGCAGCAGCCAGCCCACCGCGTCGTCGAACACCGCTGCCGC 

10741 + + + + + + 10800 

GTCGCGGCAGTGGTCTCGCTCGTCGTCGGTCGGGTGGCGCAGCAGCTTGTGGCGACGGCG 
0 LATVLALLLWGVADDFVAAA- 

GATGAGCAGCTGGCCGACGTTGCGGTGCGTCAGATTCAGGTCGGCGAGCGTCTTGGCGAT 

10801 + + + + + + 10860 

CTACTCGTCGACCGGCTGCAACGCCACGCAGTCTAAGTCCAGCCGCTCGCAGAACCGCTA 
0 ILLQGVNRHTLNLDALTKAI- 

CACCGGGAGGGCCGTGACACACATCGCGACCCCGAGGAACAGCGCGAAGACGCCCCGCTC 

10861 + + + + + + 10920 

GTGGCCCTCCCGGCACTGTGTGTAGCGCTGGGGCTCCTTGTCGCGCTTCTGCGGGGCGAG 
0 VPLATVCMAVGLFLAFVGRE- 

TCCGGAGTCCGCGAGCAGCGAGGCGGGCACCAGGTAGCCGGTGGCGATGCCCAGCCCCAG 

10921 + + + + + + 10980 

AGGCCTCAGGCGCTCGTCGCTCCGCCCGTGGTCCATCGGCCACCGCTACGGGTCGGGGTC 
0 GSDALLSAPVLYGTAIGLGL- 

AGGAATCAGAAGACCCGCCAGGCTGACCCGGGCGGCCAGACCCCCGCGCTTGCGCAGGAT 

10981 + + + + --- + + 11040 

TCCTTAGTCTTCTGGGCGGTCCGACTGGGCCCGCCGGTCTGGGGGCGCGAACGCGTCCTA 
0 P I LLGALSVRAALGGRKRL I- 

CCGGGGGTCGAACTGGGCACCTGCGATGGCCACCAGCAGAAGGACGCCGAACTGGCAGAA 

11041 + + + + + + IHOO 

GGCCCCCAGCTTGACCCGTGGACGCTACCGGTGGTCGTCTTCCTGCGGCTTGACCGTCTT 
0 RPDFQAGAIAVLLLVGFQCF- 

CGCGTCGAGCAGGTGCGCCTGCGAGATGTCCTCGGGAAACAGCCTGCCGGAAAGTCCCGG 

11101 + + + + + + 11160 

GCGCAGCTCGTCCACGCGGACGCTCTACAGGAGCCCTTTGTCGGACGGCCTTTCAGGGCC 

0 ADLLHAQS IDEPFLRGSLGP- 

CGAGATCTGCCCCAGCAGGGTCGGCCCGAGCAGTACCCCCGCGGTCAGCTCCCCCACCAG 

11161 + + + + + + 11220 

GCTCTAGACGGGGTCGTCCCAGCCGGGCTCGTCATGGGGGCGCCAGTCGAGGGGGTGGTC 
0 S I QGLLT PGLLVGATLEGVL- 

CGGCGGCAGACCGATCCGGGTCCCCAGCCGTCCCAGACCGTAGGCACAGGCGAGCAGGAG 

112 21 + + + + + + 11280 

GCCGCCGTCTGGCTAGGCCCAGGGGTCGGCAGGGTCTGGCATCCGTGTCCGCTCGTCCTC 
0 PPLGI RTGLRGLGYACALLL- 
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GCCGACCTGGAGCAGGAAGACCGTCAGCGGCTCCCCGCCCAGCGGCGACGTGGCTGCGAG 

11281 + + + + + + H340 

CGGCTGGACCTCGTCCTTCTGGCAGTCGCCGAGGGGCGGGTCGCCGCTGCACCGACGCTC 
GVQLLFVTLPEGGLPSTAAL- 

CACAGCCACGTCAGGACCGCGCACCGGGAACCCAGCCCAGCCCGTCCGTCGACGCGGCCA 

11341 + + + + + + 11400 

GTGTCGGTGCAGTCCTGGCGCGTGGCCCTTGGGTCGGGTCGGGCAGGCAGCTGCGCCGGT 

V A V - 

★ *SRAGPVWGLGDTSAA- 

GACCCCCCTGCCTCACCGGTCGCTCGGCCCCCGGCTCATCCCCCAGAAGAGCCCGTGCCT 

11401 + + + + + + H460 

CTGGGGGGACGGAGTGGCCAGCGAGCCGGGGGCGGAGTAGGGGGTCTTCTCGGGCACGGA 
LGGQRVPREAGAEDGLLARA 

GCAGTGCGGCGCTCTGCTCCATGAGGCGGCCCACCACCTTTCCCGGCACGGCGCCGTGCG 

11461 + + + + + + 11520 

CGTCACGCCGCGAGACGAGGTACTCCGCCGGGTGGTGGAAAGGGCCGTGCCGCGGCACGC 

QLAASQEMLRGVVKGPVAGH 

GCCCGTCGGCGTCGCCCGCAGCGGTGTGCGTCATGCCGGCCATCTCGTCGGACGCCTCGG 

11521 + + + + + + 11580 

CGGGCAGCCGCAGCGGGCGTCGCCACACGCAGTACGGCCGGTAGAGCAGCCTGCGGAGCC 

PGDADGAATHTMGAMEDSAE 

AGAACCGCTGCCTGGCCCGGGCCGTGTCGGCGAACTCGTCGGAGGAGACCCCGCCGATCA 

11581 + + + + + + H640 

TCTTGGCGACGGACCGGGCCCGGCACAGCCGCTTGAGCAGCCTCCTCTGGGGCGGGTAGT 
S FRQRARATDAFEDSSVGG I 

GTTCGACGAAGGACTGCAGGTCGGAGTCCGCGGTGTTGGAGATCTTCCGGGCCTGCCAGA 

11641 + + + + + + 11700 

CAAGCTGCTTCCTGACGTCCAGCCTCAGGCGCCACAACCTCTAGAAGGCCCGGACGGTCT 

LEVFSQLDSDATNS I KRAQW 

AATAGGAGTCCTCCGAATGGTGCATGTCGTAGAAGCCGACCAGGAACTCGTAGAAGCGGC 

11701 + + + + + + 11760 

TTATCCTCAGGAGGCTTACCACGTACAGCATCTTCGGCTGGTCCTTGAGCATCTTCGCCG 

FYSDESHHMDYFGVLFEYFR 

CGTACTCCAGCCGGTAGCGGGCCTCGAACTCCTCGAACGCGCTGGTCTCGTCGACCGACC 

11 761 + + + + + + 11820 

GCATGAGGTCGGCCATCGCCCGGAGCTTGAGGAGCTTGCGCGACCAGAGCAGCTGGCTGG 
GYELRYRAEFEEFASTEDVS 

CGTCCAGGCAGGAGTTGAGCGAGCGCGCTGCCAGCAGTCCGCTGTAGGTGGCGAGGTGCA 

11821 + + + + + + H880 

GCAGGTCCGTCCTCAACTCGCTCGCGCGACGGTCGTCAGGCGACATCCACCGCTCCACGT 
GDLCSNLSRAALLGSYTALH 

CCCCGGAGGAGAACACCGGGTCGACGAAGCACGCGGCATCCCCGACCAGGGCCATGCCCG 

11881 + + + + + + 11940 

GGGGCCTCCTCTTGTGGCCCAGCTGCTTCGTGCGCCGTAGGGGCTGGTCCCGGTACGGGC 

VGSSFVPDVFCAADGVLAMG 

GCGCCCAGAACTTCGTGTTGCTGTACGACCAGTCCTTGCGGACCCGGAGCTCGCCGTAGG 

119 41 + + + + + : + 12000 

CGCGGGTCTTGAAGCACAACGACATGCTGGTCAGGAACGCCTGGGCCTCGAGCGGCATCC 
PAWFKTNSYSWDKRVRLEGY 

GGCCCTCGGTCACCCGGGTGGCCTCGGAGAGCTTCTCCGCGATCAGCGGGCAGGCCGCGA 

12001 + + + + + + 12060 

CCGGGAGCCAGTGGGCCCACCGGAGCCTCTCGAAGAGGCGCTAGTCGCCCGTCCGGCGCT 

PGETV RTAESLKEAI LPCAA 
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TGAACGACTCCATCGCCTTCTCGGGGTCGCCCTGCACCAGGCTCGCCGAGTCCCGGTTCA 

12061 + - + + + + + 12120 

11 I FSEMAKEPDGQVLSASDRN 

CCACTGCGCCGACACTCGTCAGCTCGGGAGACAGGGGTATGTACCAGAACCACCCGTGCT 

12121 + + + + + + 12180 

GGTGACGCGGCTGTGAGCAGTCGAGCCCTCTGTCCCCATACATGGTCTTGGTGGGCACGA 
11 VVAGVSTLEPSLPIYWFWGH 

CGAAGGTGCAGGTGAAGATGTTCCCGGAGTTCGGCTTCGGAAGCCGCTTGCCGCCGTTGA 

12181 + + + + + + 12240 

GCTTCCACGTCCACTTCTACAAGGGCCTCAAGCCGAAGCCTTCGGCGAACGGCGGCAACT 
11 EFTCTFINGSNPKPLRKGGN 

AGTAGCCGAACAGGGCCAGGTTGCGGAAGAAGGGCGAGTACTCGCGCTTGGCGCCCGACT 

12241 + + + + + + 12300 

TCATCGGCTTGTCCCGGTCCAACGCCTTCTTCCCGCTCATGAGCGCGAACCGCGGGCTGA 
11 FYGFLALNRFFPSYERKAGS 

TCTTGTACAGCCCACCGGTGTTGCCGGAGGCGTCCACGACGAAACGGGAGCCCACCTCGT 

12301 + + + + + + 12360 

AGAACATGTCGGGTGGCCACAACGGCCTCCGCAGGTGCTGCTTTGCCCTCGGGTGGAGCA 
11 KKYLGGTNGSADVVFRSGVE 

GCTCGCGCCCCTCGGAGTCCCGGTAGCGCACGCCCCGCACCCGGCCGTCCTCGGCCTTGA 

12361 + - + + + + + 12420 

CGAGCGCGGGGAGCCTCAGGGCCATCGCGTGCGGGGCGTGGGCCGGCAGGAGCCGGAACT 
11 HERGESDRYRVGRVRGDEAK 

GCACGTCGAGGACATCGCTGTTCTCCCGCACCTCGACACCGTGCCTGCGAGCGTTGTCGA 

12421 + + + + + + 12480 

CGTGCAGCTCCTGTAGCGACAAGAGGGCGTGGAGCTGTGGCACGGACGCTCGCAACAGCT 
11 LVDLVDSNERVEVGHRRAND 

GCAGGATCTGGTCGAACTTCATGCGCTCGACCTGGTACGCGTACCCCGTCGCCCCCGGCA 

12481 + + + + + + 12540 

CGTCCTAGACCAGCTTGAAGTACGCGAGCTGGACCATGCGCATGGGGCAGCGGGGGCCGT 
11 LL I QDFKMREVQYAYGTAGP 

TCCGGCGCGAGACGGCGAAGTCGAACGTCCACGGTTCGGGGTTGGCACCCCACTTGAACG 

12541 + + + + + + 12600 

AGGCCGCGCTCTGCCGCTTCAGCTTGCAGGTGCCAAGCCCCAACCGTGGGGTGAACTTGC 
11 MRRSVAFDFTWPEPNAGWKF 

TCCCGCCGTGCTTGATCGTGAAGGCTGCCTTCTTCAGCTCGTCGGAGACACCGAGGAGGT 

12601 + + + + + + 12660 

AGGGCGGCACGAACTAGCACTTCCGACGGAAGAAGTCGAGCAGCCTCTGTGGCTCCTCCA 
11 TGGHKITFAAKKLEDSVGLL 

GTGCGATGCCGTGGACGGTGGAGGGGAGGAGCGACTCACCGATCTGGTAGCGCGGGAAGG 

12661 + - - + + + + + 12720 

CACGCTACGGCACCTGCCACCTCCCCTCCTCGCTGAGTGGCTAGACCATCGCGCCCTTCC 
11 HAIGHVTSPLLSEGIQYRPF 

TCTCCTTCTCCAGCTGGAGTACGCGATGGCCCCGCTTGCGGACCAGCGTGGAGACGGTCG 

12721 + + + + + + 12780 

AGAGGAAGAGGTCGACCTCATGCGCTACCGGGGCGAACGCCTGGTCGCACCTCTGCCAGC 
11 TEK ELQLVRHGRKRVLTSVT 

AGCCCGCCGGACCTCCGCCGACCACGATGACGTCGTACTGCGCTGACACGTCCACGGACT 

12781 + - + + + + + 12840 

TCGGGCGGCCTGGAGGCGGCTGGTGCTACTGCAGCATGACGCGACTGTGCAGGTGCCTGA 
ll-< SGAPGGGVVIVDYQASVDM 

CTCCTTCTCGCACATCGGGCGTCTCATATTCCCAGGAATCCTCTGGCCCGCCCAGGTGCT 
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12841 + + + + + + 12900 

GAGGAAGAGCGTGTAGCCCGCAGAGTATAAGGGTCCTTAGGAGACCGGGCGGGTCCACGA 

GCCGCATCTTCGGTATTGCGAAGTCGTGGGCATTCTGCGAGAAGCATGAACCGCGTGGCC 

X2901 + + + + + "- + 12960 

CGGCGTAGAAGCCATAACGCTTCAGCACCCGTAAGACGCTCTTCGTACTTGGCGCACCGG 

CGGTCTACAGTGGCGTGGAATTTCAGTGATTGCGCTGAAGGGCGGCACACGATGAAGGCA 

12 961 + + + + + + 13020 

GCCAGATGTCACCGCACCTTAAAGTCACTAACGCGACTTCCCGCCGTGTGCTACTTCCGT 

10-> M K A - 

CTTGTACTGTCGGGTGGTTCGGGGACCCGCCTGCGCCCGATCAGTTACGCCATGCCGAAG 

13021 + + + + + + 13080 

GAACATGACAGCCCACCAAGCCCCTGGGCGGACGCGGGCTAGTCAATGCGGTACGGCTTC 

10 LVLSGGSGTRLRPI SYAMPK 

CAGCTCGTTCCGATCGCCGGGAAGCCAGTCCTTGAATATGTTCTGGATAATATCCGGAAC 

13081 + + + + + + 13140 

GTCGAGCAAGGCTAGCGGCCCTTCGGTCAGGAACTTATACAAGACCTATTATAGGCCTTG 

10 QLVP IAGKPVLEYVLDNIRN 

CTCGATATCAAAGAGGTCGCCATTGTCGTCGGTGACTGGGCTCAGGAAATTATTGAGGCA 

13141 + + + + + + 13200 

GAGCTATAGTTTCTCCAGCGGTAACAGCAGCCACTGACCCGAGTCCTTTAATAACTCCGT 

10 LD IKEVAIVVGDWAQE I I EA 

ATGGGTGACGGCAGCCGTTTCGGTCTGCGCCTCACCTACATACGCCAGGAGCAACCTCTG 

13201 + + + + + + 13260 

TACCCACTGCCGTCGGCAAAGCCAGACGCGGAGTGGATGTATGCGGTCCTCGTTGGAGAC 
10 MGDGSRFGLRLTYIRQEQPL 

GGCATCGCGCACTGCGTGAAACTGGCCCGAGACTTCCTCGACGAGGACGACTTCGTCCTC 

12261 + + + + + + 13320 

CCGTAGCGCGTGACGCACTTTGACCGGGCTCTGAAGGAGCTGCTCCTGCTGAAGCAGGAG 
10 GIAHCVKLARDFLDEDDFVL 

TACCTAGGCGACATCATGCTGGACGGAGACCTGTCCGCGCAGGCGGGGCACTTCCTCCAC 

1332 i + + + + + + 13380 

ATGGATCCGCTGTAGTACGACCTGCCTCTGGACAGGCGCGTCCGCCCCGTGAAGGAGGTG 
10 YLGDIMLDGDLSAQAGHFLH 

ACCCGCCCCGCCGCGCGGATCGTCGTGCGCCAGGTGCCCGACCCCCGGGCCTTCGGGGTG 

13381 + + + + + + 13440 

TGGGCGGGGCGGCGCGCCTAGCAGCACGCGGTCCACGGGCTGGGGGCCCGGAAGCCCCAC 
10 TRPAARIVVRQVPDPRAFGV 

ATCGAGCTGGACGGCGAAGGGCGTGTGCTGCGCCTGGTCGAGAAACCCCGTGAACCGCGC 

13441 + + + + + + 13500 

TAGCTCGACCTGCCGCTTCCCGCACACGACGCGGACCAGCTCTTTGGGGCACTTGGCGCG 

10 IELDGEGRVLRLVEKPREPR 

AGCGACCTCGCGGCGGTCGGCGTGTACTTCTTCACCGCGGACGTGCACCGCGCCGTCGAC 

13501 + + + + + + 13560 

TCGCTGGAGCGCCGCCAGCCGCACATGAAGAAGTGGCGCCTGCACGTGGCGCGGCAGCTG 
10 SDLAAVGVYFFTADVHRAVD 

GCGATTAGCCCGAGCCGACGGGGCGAGCTGGAAATCACCGACGCCATCCAGTGGCTGCTG 

13561 + + + + + + 13620 

CGCTAATCGGGCTCGGCTGCCCCGCTCGACCTTTAGTGGCTGCGGTAGGTCACCGACGAC 

10 AISPSRRGELEITDAIQWLL 

GAGCAGGGCCTGCCGGTCGAGGCCGGCCGCTACACGGACTACTGGAAGGACACCGGCCGG 

12621 + + + + + + 13680 

CTCGTCCCGGACGGCCAGCTCCGGCCGGCGATGTGCCTGATGACCTTCCTGTGGCCGGCC 
10 EQGLPVEAGRYTDYWKDTGR - 
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GTCGAGGACGTCGTGGAGTGCAACCGGCGGATGCTCGGCCGTCTGGCGCTCCAGGTGTCG 

13681 + + + + + + 13740 

CAGCTCCTGCAGCACCTCACGTTGGCCGCCTACGAGCCGGCAGACCGCGAGGTCCACAGC 
10 VEDVVECNRRMLGRLALQVS 

GGCGAGGTGGACCCGGAGAGCGAACTGGTGGGTGCGGTGGTCGTCGAGGAGGGCGCCCGG 

13741 + + + + + + 13800 

CCGCTCCACCTGGGCCTCTCGCTTGACCACCCACGCCACCAGCAGCTCCTCCCGCGGGCC 
10 GEVDPESELVGAVVVEEGAR 

GTGACGCGTTCGCGGGTCGTGGGACCAGCGGTGATCGGCGCGGGCACGGTCGTCGAGGAC 

13801 + + + + + + 13860 

CACTGCGCAAGCGCCCAGCACCCTGGTCGCCACTAGCCGCGCCCGTGCCAGCAGCTCCTG 
10 VTRSRVVGPAVIGAGTVVED 

AGCCAGATCGGACCGTACGCCTCCATCGGCCGGCGCTGCACCGTGCGGGCGTCCCGGCTC 

13861 + + + + + + 13920 

TCGGTCTAGCCTGGCATGCGGAGGTAGCCGGCCGCGACGTGGCACGCCCGCAGGGCCGAG 
10 SQIGPYAS IGRRCTVRASRL 

TCCGACTCCATCGTCCTTGACGACGCCTCGATCCTCGCGGTGAGCGGACTGCACGGCTCG 

13921 + + + + + + 13980 

AGGCTGAGGTAGCAGGAACTGCTGCGGAGCTAGGAGCGCCACTCGCCTGACGTGCCGAGC 
10 S DSIVLDDAS ILAVSGLHGS 

CTGATCGGAAGGGGCGCGCGGATCGCGCCCGGGGCCCGGGGCGAGGCCCGGCACCGGCTG 

13981 + + + + + + 14040 

GACTAGCCTTCCCCGCGCGCCTAGCGCGGGCCCCGGGCCCCGCTCCGGGCCGTGGCCGAC 
10 LIGRGARIAPGARGEARHRL 

GTCGTCGGCGACCACGTGCAGATCGAGATCGCGGCCTGACGCACCCACCGGAGCACCGGG 

14041 + + + + + + 14100 

CAGCAGCCGCTGGTGCACGTCTAGCTCTAGCGCCGGACTGCGTGGGTGGCCTCGTGGCCC 
10-* VVGDHVQI E I A A * - 

GGGAGGCTCGGCAGGGGCGTCAGGCCGTAAGAAGGGCTGCCGGGGCGGGACGGACCCGCC 

14101 + + + + - + + 14160 

CCCTCCGAGCCGTCCCCGCAGTCCGGCATTCTTCCCGACGGCCCCGCCCTGCCTGGGCGG 

CCGGCAGCCCACAGGTCCCCGGTCCGCGGATATGGGGGACTCGAGGTTCGATCAGCCGAA 

14161 + + + + + + 14220 

GGCCGTCGGGTGTCCAGGGGCCAGGCGCCTATACCCCCTGAGCTCCAAGCTAGTCGGCTT 
9-* * G F - 

GGTCAGAGCCACGTGGCCGAGGTCGAGCCCGGAGTTGCCGGCGCCGAGGTTACAGGCGGC 

14221 + + + + + + 14280 

CCAGTCTCGGTGC7VCCGGCTCCAGCTCGGGCCTCAACGGCCGCGGCTCCAATGTCCGCCG 
9 TLAVHGLDLGSNGAGLNCAA- 

CGTGGCGCAGTCGACGCTGCCGACCGGCGTGCCTTCGGGCGTGGAGCCCGTGTACGACTT 

14281 + + + + + + 14340 

GCACCGCGTCAGCTGCGACGGCTGGCCGCACGGAAGCCCGCACCTCGGGCACATGCTGAA 
9 TACDVSGVPTGEPTSGTYSK- 

GCGCACGACGAAGCTGAACGACGCCGCTCCGGACGCGTCCGTGGTGAAGGACGTCGCGGT 

14341 + + + + + + 14400 

CGCGTGCTGCTTCGACTTGCTGCGGCGAGGCCTGCGCAGGCACCACTTCCTGCAGCGCCA 
9 RVVFS FSAAGSADTTFSTAT- 

CGCCGGGTTGCACGCGTCCTGGCCACCGACCGGAGCGCACTGGGCGATGTAGTAGGTCTC 

14401 + + + + + + 14460 

GCGGCCCAACGTGCGCAGGACCGGTGGCTGGCCTCGCGTGACCCGCTACATCATCCAGAG 
9 APNCADQGGVPACQAIYYTE- 



GCCGGCGGCGGCACCGCTGACCGACACCGACACGCTCTGTCCGTCACTCAGACCCGAGGC 
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14461 + + + + + + 14520 

CGGCCGCCGCCGTGGCGACTGGCTGTGGCTGTGCGAGACAGGCAGTGAGTCTGGGCTCCG 
9 GAAAGSVSVSVS QGDS LGSA- 

GGGACTGACGGAGAAGGCGGGCGCGGCGAAGGCGACGGACTGTGCGGCGGCGGCCAGGCC 

14521 + + + + - + -"■ — + 14580 

CCCTGACTGCCTCTTCCGCCCGCGCCGCTTCCGCTGCCTGACACGCCGCCGCCGGTCCGG 
9 PSVS FAPAAFAVSQAAAALG- 

GATGGATGCGACGGCCACGACGCCGAACCTGGAAGCACGGCGGGACATGTGACGTAACGA 

14581 + + + + + + 14640 

CTACCTACGCTGCCGGTGCTGCGGCTTGGACCTTCGTGCCGCCCTGTACACTGCATTGCT 
9 I SAVAVVGFRSARRSMHRLS- 

CATGCGTAGGCTCCGATTCGAGGAGGGGGTTGATCACTCCATGAAAGGATCACCTCGCCG 

14641 + + + + + + 14700 

GTACGCATCCGAGGCTAAGCTCCTCCCCCAACTAGTGAGGTACTTTCCTAGTGGAGCGGC 

9-< M - 



8-' 



* R A 

GACGGCCGCCTGCATCTCCCTCTGTGCTCTCGTGGATTTCCGGCACGGCACTCCCGTCGA 

14701 + + + + + + 14760 

CTGCCGGCGGACGTAGAGGGAGACACGAGAGCACCTAAAGGCCGTGCCGTGAGGGCAGCT 
PRGGADGETS EH I E PVAS GD 

CGGCCGCCCGCAGAATGCGGCAGACCCCCCGCACCTCCTCCGGCCCCACCGCCGTACCGG 

14761 + + + " + + + 14820 

GCCGGCGGGCGTCTTACGCCGTCTGGGGGGCGTGGAGGAGGCCGGGGTGGCGGCATGGCC 
VAARLIRCVGRVEEPGVATG 

TGGGCAGCGACAGCACCCGCTCGGTGAGCGCCTCCACCTTCGGGAGCGGATCGGGCGCGT 

14821 + + + + + + 14880 

ACCCGTCGCTGTCGTGGGCGAGCCACTCGCGGAGGTGGAAGCCCTCGCCTAGCCCGCGCA 
TPLSLVRETLAEVKPLPDPA 

GGCGCGCGAGGTCGGACCGGTAGGGCTCGCAGCTGTGGCAGCCGGGGCTGAAGTAGGCGC 

14881 + + + + + + 14940 

CCGCGCGCTCCAGCCTGGCCATCCCGAGCGTCGACACCGTCGGCCCCGACTTCATCCGCG 
HRALDSRYPECSHCGPSFYA 

GGGCCAGGACGTTGTGCCGTTGGAGCACCGCCTGGAGTTCGTCGCGGTGCAGCCCGGCGC 

14941 + + + + + --+ 15000 

CCCGGTCCTGCAACACGGCAACCTCGTGGCGGACCTCAAGCAGCGCCACGTCGGGCCGCG 
RALVNHRQLVAQLEDRHLGA 

GGACGGCGTCCACCTCGATGACGACGTACTGGCAGTTCGACAGCTCGTTCGGATCCTGCG 

15001 + - + + + + + 15060 

CCTGCCGCAGGTGGAGCTACTGCTGCATGACCGTCAAGCTGTCGAGCAAGCCTAGGACGC 
RVADVEIVVYQCNSLENPDQ 

GGCGGACCCGGACGCCGGGCAGTCCGTCGAGGTACTGCTCGTACAGACGGTAGTTGCGCC 

15061 + + + + + + 15120 

CCGCCTGGGCCTGCGGCCCGTCAGGCAGCTCCATGACGAGCATGTCTGCCATCAACGCGG 
PRVRVGPLGDLYQEYLRYNR 

GGTTGATCGCGGTGAAGTGATCGGCGGACTCCAGGGAGGTGAGGCCCATGGCCGCGCTGA 

15121 + + + + + - + 15180 

CCAACTAGCGCCACTTCACTAGCCGCCTGAGGTCCCTCCACTCCGGGTACCGGCGCGACT 
RNIATFHDASELSTLGMAAS 

TCTCGTGCATCCGCGCGACCGTTCCGCTCCCGGTGATCTCATGCGCGGCGTTGAGCCCCT 

15181 + -- -- + + + + + 15240 

AGAGCACGTAGGCGCGCTGGCAAGGCGAGGGCCACTAGAGTACGCGCCGCAACTCGGGGA 
I EHMRAVTGSGTI EHAANLG 

GGTGGCGCATGGCCCGGAGCCGGTCGGCCAGGGCGTCGTCGTCGGTGACGATCGCCCCGC 
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15241 + + + + + + 15300 

CCACCGCGTACCGGGCCTCGGCCAGCCGGTCCCGCAGCAGCAGCCACTGCTAGCGGGGCG 
8 QHRMARLRDALADDDTVIAG 

CCTCGAAGCTGTTCACGAACTTCGTCGCCTGGAAGCTGAAGATCTCCGCCGTGCCGAAGC 

15301 + + + + + + 15360 

GGAGCTTCGACAAGTGCTTGAAGCAGCGGACCTTCGACTTCTAGAGGCGGCACGGCTTCG 
8 GEFSNVFKTAQFSFIEATGF 

CGCCGATCGGCTTCGACCGGTAGGTGCAGCCGAAGGCGTGGGCGGCATCGAAGAGCAGGT 

15361 + + + + + + 15420 

GCGGCTAGCCGAAGCTGGCCATCCACGTCGGCTTCCGCACCCGCCGTAGCTTCTCGTCCA 
8 GGIPKSRYTCGFAHAADFLL 

GCAGCCCGTGCTCGGCGGCCAGCTTGGTCAGCTCGTCGATCCGGGCCGGTCTGCCGAAGA 

15421 + + + + + + 15480 

CGTCGGGCACGAGCCGCCGGTCGAACCAGTCGAGCAGCTAGGCCCGGCCAGACGGCTTCT 
8 HLGHEAALKTLED IRAPRGF 

CGTGCACGTCCAGGATGGCGCGGGTACGCGGGCCGATGAGCCGCTCCACGTGTGCCACGT 

15481 + + + + + + 15540 

GCACGTGCAGGTCCTACCGCGCCCATGCGCCCGGCTACTCGGCGAGGTGCACACGGTGCA 
8 VHVDL IARTRPGI LREVHAV 

CCGCGGTTCCGGTCTCCTCGTCCAGTTCGCAGAAGACAGGCACCGCACCGATCCAGTCCA 

15541 + + + + + + 15600 

GGCGCCAAGGCCAGAGGAGCAGGTCAAGCGTCTTCTGTCCGTGGCGTGGCTAGGTCAGGT 
8 DATGTEEDLECFVPVAGIWD 

GTGCGTGGGCGGTGGCGACCCAGGTGAAGGAGGGCACGATCACCTCGTCCCCAGGACCGA 

15601 + + + + + + 15660 

CACGCACCCGCCACCGCTGGGTCCACTTCCTCCCGTGCTAGTGGAGCAGGGGTCCTGGCT 
8 LAHATAVWTFS PVIVEDGPG 

TGCCCAGGGCCTTCGCGGCGACCTGGATGCCGGTGGTGGCGTTCGATACGGCGACGCAGT 

15661 + + + + + + 15720 

ACGGGTCCCGGAAGCGCCGCTGGACCTACGGCCACCACCGCAAGCTATGCCGCTGCGTCA 
8 I GLAKAAVQ I GTTANSVAVC 

GCCTGACCTGGGTCAGCTCGGCCACACGGGCCTCGAACTCCCGGACCAGGGGGCCGTCAT 

15721 + + + + + + 15780 

CGGACTGGACCCAGTCGAGCCGGTGTGCCCGGAGCTTGAGGGCCTGGTCCCCCGGCAGTA 
8 HRVQTLEAVRAEFERVL PGD 

TGGTGAACCACAGGCGCTCCAGCGCCCCGTCGATCCGTTCCATCAAACGGTCGCGGGAGC 

15781 + + + + + + 15840 

ACCACTTGGTGTCCGCGAGGTCGCGGGGCAGCTAGGCAAGGTAGTTTGCCAGCGCCCTCG 
8 N. TFWLRELAGDIREMLRDRS 

CCACGTTCGGGCGTCCCACGTGCAGCGGTTCGCTGAAGTAGGGCGTGGGTAGGGAGTCCA 

15841 + + + + + + 15900 

GGTGCAAGCCCGCAGGGTGCACGTCGCCAAGCGACTTCATCCCGCACCCATCCCTCAGGT 
8 GVNPRGVHLPESFYPTPLSD 

GACGCACCGGGCCGCCGCTCATGCCGTGCGCACGCCGACGAAGAGGCCGGGGCTGTTGGG 

15901 + +■ + + + + 15960 

CTGCGTGGCCCGGCGGCGAGTACGGCACGCGTGCGGCTGCTTCTCCGGCCCCGACAACCC 
a DAPGRRS CRAHADEEAGAVG 

b THRAAAHAVRTPTKRPGLLG- 
C RTGP PLMPCARRRRGRGCWA- 

15901 + + + + + + 15960 

7- * *ATRVGVFLGPSNP- 

8- < LRVPGGSM 

CCGGCCGTCGGCCAGCCGGAAGCCGGGCACGAACCGCACCGAGAGCCCCACCGATTCGPlA 
15961 + + + + + + 16020 
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GGCCGGCAGCCGGTCGGCCTTCGGCCCGTGCTTGGCGTGGCTCTCGGGGTGGCTAAGCTT 
7 RGDALRFGPVFRVS LGVS E F - 

GGCGTCGGTGTACTGCTCGCGGGTGAAGAGGCTGGAGGTCAGGACCTCGGAGAACTCTCT 

16021 + + + + + + 16080 

CCGCAGCCACATGACGAGCGCCCACTTCTCCGACCTCCAGTCCTGGAGCCTCTTGAGAGA 
7 ADTYQERTFLSSTLVESFER- 

GAAGCCGGAGGCGTCCGCGACCCGGAACCGGACCTCCAGACGTGACTTGTCGCCCTGGCG 

16081 + + + + + + 16140 

CTTCGGCCTCCGCAGGCGCTGGGCCTTGGCCTGGAGGTCTGCACTGAACAGCGGGACCGC 
7 FGSADAVRFRVELRS KDGQR- 

CACGGAGTGCGTCATCCGCGTGATGACACGGCCCTCCTCCTGGTGCAGATGGCCGCCGAC 

16141 + + + + + + 16200 

GTGCCTCACGCAGTAGGCGCACTACTGTGCCGGGAGGAGGACCACGTCTACCGGCGGCTG 
7 VSHTMRT IVRGEEQHLHGGV- 

ATGCCCGTCGAGGAAGTTCTCGGGGAAATACCAGGGTTCGGCGACGAGGACTCCCCCGGG 

16201 + + + + + + 16260 

TACGGGCAGCTCCTTCAAGAGCCCCTTTATGGTCCCAAGCCGCTGCTCCTGAGGGGGCCC 
7 HGDLFNE P FYWPEAVLVGGP- 

GTTCAGGTGGTGGGCCATGGCCGACACCGCGGCCTTGAGCTCGGTGACGGACCCCATCTC 

16261 + + + + + + 16320 

CAAGTCCACCACCCGGTACCGGCTGTGGCGCCGGAACTCGAGCCACTGCCTGGGGTAGAG 
7 NLHHAMA S VAAKL ETVS GM E - 

GCCGAGCGCGTTGCCCATGCAGGTGATCGCGTCGAAGGTGCGGCCCAGGTCGAACGAACG 

16321 + + + + + + 16380 

CGGCTCGCGCAACGGGTACGTCCACTAGCGCAGCTTCCACGCCGGGTCCAGCTTGCTTGC 
7 GLANGMCT IADFTRGLDFS R - 

CATGTCACCGGCGTGCAGCGGGACGCCGGGAAGCCGGCCCGCCGCCTGCTCCAGCATCGC 

16381 + + + + + + 16440 

GTACAGTGGCCGCACGTCGCCCTGCGGCCCTTCGGCCGGGCGGCGGACGAGGTCGTAGCG 
7 MDGAHL PVG PLRGAAQE L M A - 

GGGCGCGTACTCGAGGCCCTCCACATGGCCGAAGAGCGTGGCGAGCGTCTCCAGATGGGC 

16441 + + + + + + 16500 

CCCGCGCATGAGCTCCGGGAGGTGTACCGGCTTCTCGCACCGCTCGCAGAGGTCTACCCG 
7 PAYELGEVHGFLTALTELHA- 

TCCGGTGCCGCAGGCGACGTCCAGGAGCGACACGGCGTCGGGGCGGGCGGCGAGGATCAG 

16501 + + + + + + 16560 

AGGCCACGGCGTCCGCTGCAGGTCCTCGCTGTGCCGCAGCCCCGCCCGCCGCTCCTAGTC 
7 GTGCAVDLLSVADPRAALIL- 

CTCGGTGAGCCCGCGGGCCTCCAGGTCGAAGTCCTTGCCGCGGCTGCGGAACACGAGGTC 

16561 + + + + + + 16620 

GAGCCACTCGGGCGCCCGGAGGTCCAGCTTCAGGAACGGCGCCGACGCCTTGTGCTCCAG 
7 ETLGRAELDFDKGRSRFVLD- 

GTAGAACTTCGCGTGCTCGGGGCCGTACTCCATCAGACGAGCTCCTTCGCAGACTGGGCG 

16621 + + + + + + 16680 

CATCTTGAAGCGCACGAGCCCCGGCATGAGGTAGTCTGCTCGAGGAAGCGTCTGACCCGC 
7-< YFKAHEPGYEM- 

6-* *VLEKASQA- 

GAGATGATTCTGGGCTCCGGGATGGGAACGATGAACTTCCCTCCCGCCTCCAGGAAGCGG 

16681 + + + + + + 16740 

CTCTACTAAGACCCGAGGCCCTACCCTTGCTACTTGAAGGGAGGGCGGAGGTCCTTCGCC 
6 S I IRPEPI PVI FKGGAELFR- 

CGCTCCTTGCGGACGACCTCGTCGGTGTAGTTCCAGGCGAGGAGGAGGTAGTAGTCCGGC 
16741 + + + + + + 16800 
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GCGAGGAACGCCTGCTGGAGCAGCCACATCAAGGTCCGCTCCTCCTCCATCATCAGGCCG 
REKRVVEDTYNWALLLYYDP- 

TCGGTGGCAGCGACCTCCTCCGGAGGAAGGACCGGGATGCGGTTCCCCGGCAGCAGTTTG 

16801 + + + + + + 16860 

AGCCACCGTCGCTGGAGGAGGCCTCCTTCCTGGCCCTACGCCAAGGGGCCGTCGTCAAAC 
ETAAVEE PPLVPIRNGPLLK- 

CCGTGCTTGAGGCTGGTGGTGTCGCCGCAGACGGTGATGTCCTGATCCGTCAGACCGCAG 

16861 + + + + + + 16920 

GGCACGAACTCCGACCACCACAGCGGCGTCTGCCACTACAGGACTAGGCAGTCTGGCGTC 
GHKLSTTDGCVTIDQDTLGC- 

GCCATCAGCAACTGGGTCCCCTTGGACGGTGCTCCGTAGCCGGCCACGCGGTGGCCGTCC 

16921 + + + + + + 16980 

CGGTAGTCGTTGACCCAGGGGAACCTGCCACGAGGCATCGGCCGGTGCGCCACCGGCAGG 
AMLLQTGKSPAGYGAVRHGD - 

GCGGCCAGACCGCGAACGAGCGTACGGATCGCTTCGGTCACGCGCGTCACCCGCTCGGCG 

16981 + + + + + + 17040 

CGCCGGTCTGGCGCTTGCTCGCATGCCTAGCGAAGCCAGTGCGCGCAGTGGGCGAGCCGC 
AALGRVLTRIAETVRTVREA- 

AACGCCCGGTAGGGGGCATCCGTCAGCAGTCCGCGCTCCTCCTCCAGGCCGAGCAGCGCC 

17041 + + + + + + 17100 

TTGCGGGCCATCCCCCGTAGGCAGTCGTCAGGCGCGAGGAGGAGGTCCGGCTCGTCGCGG 
F ARYPADTLLGREEELGLLA- 

GCGACCGAGGGCTCCGGGACCCGTGCGGCCGACTCGCGCGCGGCGACGACCGCGATCGAA 

17101 + + + + + + 17160 

CGCTGGCTCCCGAGGCCCTGGGCACGCCGGCTGAGCGCGCGCCGCTGCTGGCGCTAGCTT 
AVS PEPVRAASERAAVVAI S - 

CCGCCGTGCACGGCGACCCGCTCCACGTCGATGATCCGCAGGCCGTGCGCGCCGAAGAGG 

17161 + + + + + + 17220 

GGCGGCACGTGCCGCTGGGCGAGGTGCAGCTACTAGGCGTCCGGCACGCGCGGCTTCTCC 
GGHVAVREVD I IRLGHAGFL- 

TGGCGCAGTGTGTGCAGGGAGAAGTACGACAGGTGCTCGTGGTAGATCGTGTCGAACTGG 

17221 + + + + + + 17280 

ACCGCGTCACACACGTCCCTCTTCATGCTGTCCACGAGCACCATCTAGCACAGCTTGACC 
HRLTHLSFYSLHEHYITDFQ- 

TTCTCGTCGAGCAGGTTCAGCAGGTACGGCACCTCGATGACCAGGACGCCGTCGTCGTCG 

17281 + + + + + + 17340 

T^AGAGCAGCTCGTCCAAGTCGTCCATGCCGTGGAGCTACTGGTCCTGCGGCAGCAGCAGC 
NEDLLNLLYPVEIVLVGDDD- 

AGCACTGCGTCGACGCCGTCCAGGATGCGGTGCACGTCGTCGATGTGCGCGAAGCACTGG 

17341 + + + -f + + 17400 

TCGTGACGCAGCTGCGGCAGGTCCTACGCCACGTGCAGCAGCTACACGCGCTTCGTGACC 
LVADVGDLI RHVDD IHAFCQ 

CGGCCGATGACGGCCTTGGCCCTGCCCTGCTCAAGGGCGATGCGGCCCGCGGGCTCCGGG 

17401 + + + + + + 17460 

GCCGGCTACTGCCGGAACCGGGACGGGACGAGTTCCCGCTACGCCGGGCGCCCGAGGCCC 
RGIVAKARGQELAI RGAPEP- 

CCGAAGAAGTCCGGGTCCGTGGGGATCCCCCGGGCGTTGGCGATCTCGGCGAGGTTGGCC 

17461 + + + + + + 17520 

GGCTTCTTCAGGCCCAGGCACCCCTAGGGGGCCCGCAACCGCTAGAGCCGCTCGAACCGG 
GFFDPDTPIGRANAIEALNA- 

GCCGGGTCGACCCCGGCCACCCGCATGCCCGCCGCCCGGAACATCGCGAGCTGGGTGCCG 

17521 + + + + + + 17580 

CGGCCCAGCTGGGGCCGGTGGGCGTACGGGCGGCGGGCCTTGTAGCGCTCGACCCACGGC 
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6 APDVGAVRMGAARFMALQTG- 

ACGTTGCTGCCCAGCTCCACGACCAGGTCGCCGGAGGCGAGGCTTGCCCGGCGGGTCGCC 

17581 + + + + + + 17640 

TGCAACGACGGGTCGAGGTGCTGGTCCAGCGGCCTCCGCTCCGAACGGGCCGCCCAGCGG 
6 VNSGLEVVLDGSALSARR TA- 

AGCCCGACGATGTGCGCCATGTGCTCGCGGATCTGGTCGGAGTCGGAGGAGACGTAGACG 

17641 + + + + + + 17700 

TCGGGCTGCTACACGCGGTACACGAGCGCCTAGACCAGCCTCAGCCTCCTCTGCATCTGC 
6 LGVIHAMHERIQDSDSSVYV- 

TAGTGCTTGAACAGTGTCCCGGGGTCGACGACATGGCGAAGCGTCATCAGCCGGCACGAC 

17701 + + + + + + 17760 

ATCACGAACTTGTCACAGGGCCCCAGCTGCTGTACCGCTTCGCAGTAGTCGGCCGTGCTG 
6 YHKFLTGPDVVHRLTMLRCS- 

CGGCACACGATGACGTCGAGCGGGAAGACGTCCTGCGCCTCATCGGCGTCGGCCGGATCG 

17761 + + + + + + 17820 

GCCGTGTGCTACTGCAGCTCGCCCTTCTGCAGGACGCGGAGTAGCCGCAGCCGGCCTAGC 
6 RCVIVDLPFVDQAEDADAPD 

ACGAACCCGTTGGCCAGCGGCAGCGAGCCGAAGGAGATCACCTCGGTCCAGTCGTCCGCA 

17821 + + + + + + 17880 

TGCTTGGGCAACCGGTCGCCGTCGCTCGGCTTCCTCTAGTGGAGCCAGGTCAGCAGGCGT 
6 VFGNALPLSGFS IVETWDDA- 

CCGCATACACGGCACGTCTCGTCCCGCCTGCATTTCTCCAGCATGAAGTCTCCTGACGGC 

17881 + + + + + + 17940 

GGCGTATGTGCCGTGCAGAGCAGGGCGGACGTAAAGAGGTCGTACTTCAGAGGACTGCCG 
6-< GCVRCTE DRRCKE L M - 

GAATGCCGACGCATCGGGCCCGTCGGTCCGGGGACGGTCAATCTAGGGTTCCGGCCGACG 

17941 + + + + + + 18000 

CTTACGGCTGCGTAGCCCGGGCAGCCAGGCCCCTGCCAGTTAGATCCCAAGGCCGGCTGC 

GGCGCTCCACTTCGTATGTGCCCTACTGGTTCAGCGGAGCGGACGGGTGAACGCCCGTAC 

18001 + + + + + + 18060 

CCGCGAGGTGAAGCATACACGGGATGACCAAGTCGCCTCGCCTGCCCACTTGCGGGCATG 
17-* * RL PRTFARV- 

GTCCTCGATGAGGAGCTGCGGCTGCTCCATGGCCGCGAAGTGCCCGCCGCGGTCGAACTC 

18061 + + + + + + 18120 

CAGGAGCTACTCCTCGACGCCGACGAGGTACCGGCGCTTCACGGGCGGCGCCAGCTTGAG 
17 DE I LLQPQEMAAFHGGRD F E - 

GGTCCACCGCGTCAGGGTCGGCAGGATGCCCTCGGCGAACGACCGGATCGGCCGGGTGGC 

18121 + + -- + + + + 18180 

CCAGGTGGCGCAGTCCCAGCCGTCCTACGGGAGCCGCTTGCTGGCCTAGCCGGCCCACCG 
17 TWRTLTPLIGEAFSRI PRTA- 

GTCGTCCGGGAACACCGCGACGCCGACGGGGGCCGTCAGCGGCCAGGGCCCGCCCCAGGT 

18181 + + + + + + 18240 

CAGCAGGCCCTTGTGGCGCTGCGGCTGCCCCCGGCAGTCGCCGGTCCCGGGCGGGGTCCA 
17 DDPFVAVGVPATL PWPGGWT- 

GCGGGCGAAGTCCGCCATGCCGCGAGCCGACTCGTAGTACAACTGAGCGCTGGAACCGGC 

18241 + + + + + + 18300 

CGCCCGCTTCAGGCGGTACGGCGCTCGGCTGAGCATCATGTTGACTCGCGACCTTGGCCG 
17 RAFDAMGRASEYYLQAS SGA- 

CGTCGCGGTCAGCCAGTAGATCATCACGTGGGTGAGCAGCCGGTCCCGGGAGATGGCCTC 

18301 + + + + + + 18360 

GCAGCGCCAGTCGGTCATCTAGTAGTGCACCCACTCGTCGGCCAGGGCCCTCTACCGGAG 
17 TATLWYIMVHTLLRDRS I A E - 
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CTCCACGTTCTTGCCGCCGCTCCACTCCTGGAACTTGTCGAGAATCCAGGCGAGCTGGCC 

18361 + + + + + + 18420 

GAGGTGCAAGAACGGCGGCGAGGTGAGGACCTTGAACAGCTCTTAGGTCCGCTCGACCGG 
17 EVNKGGSWEQFKDL I WALQG- 

GACCGGGGAGTCGGTGAGGCCGTAGGCCAGGGTCTGCGGGCGGGTGGCCTGGATGCGCTG 

18421 + + + + + + 18480 

CTGGCCCCTCAGCCACTCCGGCATCCGGTCCCAGACGCCCGCCCACCGGACCTACGCGAC 
17 VPSDTLGYALTQPRTAQ I R Q - 

CCAGCCGATGCCGGTGTCGGCGAACTCCCCGCTGTGCGCCAGCTTGCCCAGGTCGCTCTC 

18481 + + + + + + 18540 

GGTCGGCTACGGCCACAGCCGCTTGAGGGGCGACACGCGGTCGAACGGGTCCAGCGAGAG 
17 W GI GTDAFEGSHALKGLDSE- 

GTCCAGGCGCCCGATGGCCTCCGGGGCGTCCTGGGGCGGGAAGGTCACCAGCATGTTCAG 

18541 + + + + + + 18600 

CAGGTCCGCGGGCTACCGGAGGCCCCGCAGGACCCCGCCCTTCCAGTGGTCGTACAAGTC 
17 DLRGIAEPADQPPFTVLMNL- 

GTGGACGCCGGCCACGTGCTCGGGGTCGGCCAGCCCCAGCTCCAGCGAGACGACCTTTCC 

18601 + + + + + + 18660 

CACCTGCGGCCGGTGCACGAGCCCCAGCCGGTCGGGGTCGAGGTCGCTCTGCTGGAAAGG 
17 HVGAVHE PDALGLELSVVKG- 

CCAGTCGCCGCCCTGGGCGACGTAACGCTCGTAGCCGAGGCGGTTCATCAGCTCCGCCCA 

18661 + + + + + + 18720 

GGTCAGCGGCGGGACCCGCTGCATTGCGAGCATCGGCTCCGCCAAGTAGTCGAGGCGGGT 
17 WDGGQAVYREYGLRNMLEAW- 

GGCGCGTGCGATCCGCCGCACGTCCCAGCCCGGCTCGGCAGTCGGGCCGGAGAAGCCGTA 

18721 + + + + + + 18780 

CCGCGCACGCTAGGCGGCGTGCAGGGTCGGGCCGAGCCGTCAGCCCGGCCTCTTCGGCAT 
17 ARAI RRVDWGPEAT PG S F G Y - 

GCCCGGCATGGAGGGGACGACGACGTGGAAGGCGTCCGCCGGGTCGCCGCCGTGCGCGCG 

18781 + + + + + + 18840 

CGGGCCGTACCTCCCCTGCTGCTGCACCTTCCGCAGGCGGCCCAGCGGCGGCACGCGCGC 
17 GPMS PVVVHFADAP DGGHAR- 

CGGGTCGCTCAGCGGCCCGATGACGTCGAGGAACTCGGCGACCGAGCCCGGCCAGCCGTG 

18841 + + + + + + 18900 

GCCCAGCGAGTCGCCGGGCTACTGCAGCTCCTTGAGCCGCTGGCTCGGGCCGGTCGGCAC 
17 PDSLPGIVDLFEAVSGPWGH- 

GGTGAGGATCAGCGGGATCGCGTCCGGCTCGGGCGAACGCACGTGAAGGAAGTGCACGTC 

18901 + + + + + + 18960 

CCACTCCTAGTCGCCCTAGCGCAGGCCGAGCCCGCTTGCGTGCACTTCCTTCACGTGCAG 
17 TLILPIADPEPSRVHLFHVD- 

GGCGCCGTCGATCGTGGTGACGAACTGGGGGAACGCGTTCAGCTCGGCCTCCGCGGCACG 

18961 + + + + + + 19020 

CCGCGGCAGCTAGCACCACTGCTTGACCCCCTTGCGCAAGTCGAGCCGGAGGCGCCGTGC 
17 AGD I TTVFQ P FANLEAEAAR- 

CCAGTCGTAGCCGTGGCGCCAGTGGTCGGTGAGCTCCTTGAGGTAGGACAGCGGCACTCC 

19021 + + + + + + 19080 

GGTCAGCATCGGCACCGCGGTCACCAGCCACTCGAGGAACTCCATCCTGTCGCCGTGAGG 
17 WDYGHRWHDTLEKLYSLPVG- 

GCGGTCCCATCCGGATCCGGGTATCTCGGACGGCCACCGGGTCGCGTCGATCCGCCGGGT 

19081 + + + + + + 19140 

CGCCAGGGTAGGCCTAGGCCCATAGAGCCTGCCGGTGGCCCAGCGCAGCTAGGCGGCCCA 
17 RDWGSGPIESPWRTADIRRT- 

TAAGGTCGTCGAATGTCGGACTGGGTCGATCTCGATACGGAAGGGACGCACAGTGAATCC 
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19 141 + + + + + + 19200 

ATTCCAGCAGCTTACAGCCTGACCCAGCTAGAGCTATGCCTTCCCTGCGTGTCACTTAGG 

17_< LTTSHRVPDI E IRFPRM- 

ACCCTCGTGATTGTGGGAGCGGGGCGGCGCGAGGCGGCCGCCCCGATGTGATCCGGGGAC 

192 oi + + + + + + 19260 

TGGGAGCACTAACACCCTCGCCCCGCCGCGCTCCGCCGGCGGGGCTACACTAGGCCCCTG 

CGTGTCTCAGGCCGGTTCGGCCGGCGCGGCCGCGCCTTCCCGTGCGGAGAAGGACCGCAC 

19 261 + + + + + - + 19320 

GCACAGAGTCCGGCCAAGCCGGCCGCGCCGGCGCGGAAGGGCACGCCTCTTCCTGGCGTG 

16-* *APEAPAAAGERAS F S R V - 

GGAGGACAGGAAGTTGCGGATCATCGGCATGCCGTGTTCGGTCCGGAAGCTCTCCGGATG 

1932 i + + + + + + 19380 

CCTCCTGTCCTTCAACGCCTAGTAGCCGTACGGCACAAGCCAGGCCTTCGAGAGGCCTAC 
16 SSLFNRIMPMGHETRFSEPH- 

GAACTGGACGGACTCCACCGGCAGCGAACGGTGGCGCAGGCCCATCACGTACCCGTCGTC 

19381 + + + + + + 19440 

CTTGACCTGCCTGAGGTGGCCGTCGCTTGCCACCGCGTCCGGGTAGTGCATGGGCAGCAG 

16 FQVSEVPLSRHR LGMVYGDD- 

CGTGGAGCGCCCGGTGACCTCGAGGGACGGCGGGACCGTGCCCTCCGGCACGATCAGTGA 

19441 + + + + + + 19500 

GCACCTCGCGGGCCACTGGAGCTCCCTGCCGCCCTGGCACGGGAGGCCGTGCTAGTCACT 

16 TSRGTVELSPPVTGEPVILS- 

GTGGTAGCGGGTCGCGAAGAACCCCGCGGGCAGCCCGGTGAACACTCCGCGCCCGTCGTG 

19 50i + + --- + + + + 195 60 

CACCATCGCCCAGCGCTTCTTGGGGCGCCCGTCGGGCCACTTGTGAGGCGCGGGCAGCAC 
16 HYRTAFFGAPLGTFVGRGDH- 

CGTGATCCGGCTCGTCTTCCCGTGCATGAGATGCCGGGCGGGGACGGTGGCGGCGCCGTA 

19561 + + + + + + 19620 

GCACTAGGCCGAGCAGAAGGGCACGTACTCTACGGCCCGCCCCTGCCACCGCCGCGGCAT 
16 TIRSTKGHMLHRAPVTAAGY- 

GGCGCGGGCGACGGCCTGATGCCCCAGACAGACCCCGAGCAGCGGGACCCGGCCGGCGAA 

1962 i + + + + + + 19680 

CCGCGCCCGCTGCCGGACTACGGGGTCTGTCTGGGGCTCGTCGCCCTGGGCCGGCCGCTT 
16 ARAVAQHGL CVGL L PVRGAF- 

GGCCTGGACGATCTCGACGTGCCCGGAGGTGTCGGGGTGGCCGGGGCCCGGCCCCAGCAG 

1968 1 + + + + + + 19740 

CCGGACCTGCTAGAGCTGCACGGGCCTCCACAGCCCCACCGGCCCCGGGCCGGGGTCGTC 

16 AQVI EVHGSTDPHGPGPGLL- 

GACCGCGTCCGGCCGCATCAGCCCCATCTCGTCCGGGGTCATGAGATGCGACCGCACCAT 

19741 + + + + + + 19800 

CTGGCGCAGGCCGGCGTAGTCGGGGTAGAGCAGGCCCCAGTACTCTACGCTGGCGTGGTA 

16 VAD PRMLGMEDPTMLHS RVM- 

GACGGGCTCCGCGCCGGCGGACATCAGATACTGGCGCAGGATGTCGACGAAGCTGTCGAA 

19801 + + + + + + 19860 

CTGCCCGAGGCGCGGCCGCCTGTAGTCTATGACCGCGTCCTACAGCTGCTTCGACAGCTT 

16 VPEAGASMLYQRL I DVFSDF- 

CGCGTCGACCACCAGGACCCGCGGGGCCTCGGTGCCTGCGCCGGATCCGTCGGGAGACCA 

1 986 1 + + + + + + 19920 

GCGCAGCTGGTGGTCCTGGGCGCCCCGGAGCCACGGACGCGGCCTAGGCAGCCCTCTGGT 
16 ADVVLVRPAETGAGSGDPSW- 



CAAGCTCACAGCAACTCCTCTCCGGTGACCGCCCAGTGAGTGGCGCTCATCTTGGCCAGC 

+ + + + + + 

GTTCGAGTGTCGTTGAGGAGAGGCCACTGGCGGGTCACTCACCGCGAGTAGAACCGGTCG 
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16-< L S M - 

15-* * LLEEGTVAWHTASMKAL 

GTCTCGGTCCACTCCGCCCCCGGTTCGGAATCGGCGACGATTCCGGCCGAGGCCCGGGTG 

19981 + + + + + + 20040 

CAGAGCCAGGTGAGGCGGGGGCCAAGCCTTAGCCGCTGCTAAGGCCGGCTCCGGGCCCAC 
15 TETWEAGPESDAVIGASART- 

CGGTAGACGCCCTCGTGGTGGAAAAGGGTCCGGATGCACAGCGCGAGGTTGGTGTACCCG 

20041 + + + + + + 20100 

GCCATCTGCGGGAGCACCACCTTTTCCCAGGCCTACGTGTCGCGCTCCAACCACATGGGC 
15 RYVGEHHFLTRI CLALNTYG 

CCCACGTCGAGGAGGCCGAGCGCCCCGGCGTACAGGCCGCGGCGGCTGCGTTCGACGGAC 

20101 + + + + + -- + 20160 

GGGTGCAGCTCCTCCGGCTCGCGGGGCCGCATGTCCGGCGCCGCCGACGCAAGCTGCCTG 
15 GVDLLGLAGAYLGRRSREVS- 

TCGATGATCTCCATGGCGCGGATCTTCGGCGCGCCCGTCATGGTGCCGGCGGGGAACAGG 

20161 + + + + + + 20220 

AGCTACTAGAGGTACCGCGCCTAGAAGCCGCGCGGGCAGTACCACGGCCGCCCCTTGTCC 
15 E I I EMARI KPAGT MTGAPFL 

GCGGCGATGGTGTCGAAGGCATCGGTGTCCACCCGCGCCCGGCCGACGACCGTGGAGACC 

20221 + + + + + + 20280 

CGCCGCTACCACAGCTTCCGTAGCCACAGGTGGGCGCGGGCCGGCTGCTGGCACCTCTGG 
15 AA I TD FADTDVRARGVVT S V 

AGGTGCAGCACGTGGGAGTAGCCCTCCACGTCCAGCTGGTCGGGTACGTCGAGCGTGTTC 

20281 + + + + + + 20340 

TCCACGTCGTGCACCCTCATCGGGAGGTGCAGGTCGACCAGCCCATGCAGCTCGCACAAG 
15 LHLVHSYGEVDLQDPVDLTN- 

GGCCGGGCGATCCGTCCGATGTCGTTGCGGCAGAGGTCCACCAGCATGGTGTGCTCGGCG 

20341 + + + + + + 20400 

CCGGCCCGCTAGGCAGGCTACAGCAACGCCGTCTCCAGGTGGTCGTACCACACGAGCCGC 
15 PRAI RGIDNRCLDVLMTHEA- 

ATCTCCTTGGGATCCGACCTCAGCCGGACTCCCGCGGCGATGCCGCCGTCCGCGCCGGAC 

20401 + + + + + + 20460 

TAGAGGAACCCTAGGCTGGAGTCGGCCTGAGGGCGCCGCTACGGCGGCAGGCGCGGCCTG 
15 IEKPDSRLRVGAAIGGDAGS- 

CGCGGCACCGTGCCCGCGATCGGCCGCATCGTGACCTCGCCGTCCTCGATGCGTACGAAC 

20461 + + + + + + 20520 

GCGCCGTGGCACGGGCGCTAGCCGGCGTAGCACTGGAGCGGCAGGAGCTACGCATGCTTG 
15 RPVTGAI PRMTVEGDEIRVF- 

AGCTCGGGGCTGGCGCCGATCAGACGGTGCCCGTCGATGCCCGCCAGATACATGTACGGG 

20521 + + + + + + 20580 

TCGAGCCCCGACCGCGGCTAGTCTGCCACGGGCAGCTACGGGCGGTCTATGTACATGCCC 
15 LEPSAGILRHGDI GALYMYP 

GAGGCGTTCCGCCCGCGCAGGCGCTGGTAGACGTCCGCGGGGTCGGCCGTCGAGCGGATG 

20581 + + + + + + 20640 

CTCCGCAAGGCGGGCGCGTCCGCGACCATCTGCAGGCGCCCCAGCCGGCAGCTCGCCTAC 
15 SANRGRLRQYVDAPDATSRI 

GAGAGCTCGTGACCGATCTGCACCTGGTAGATGTCGCCGACGGCGATGTGCTTCAGACAC 

20641 + + + + + + 20700 

CTCTCGAGCACTGGCTAGACGTGGACCATCTACAGCGGCTGCCGCTACACGAAGTCTGTG 
15 SLEHGIQVQYIDGVAIHKLC- 

CGCTCGACGTCGTTCGCGAACACTTCGGGGGCGCTGTCGTCGGTGACCGCGGAGGCGGGG 

20701 + + + + + + 20760 

GCGAGCTGCAGCAAGCGCTTGTGAAGCCCCCGCGACAGCAGCCACTGGCGCCTCCGCCCC 
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15 REVDNAFVEPASDDTVASAP 

AAGCCGTCTGCGGACGGATCGGGCCAGGCCTGCTCCACGTCGGCGAGGAGCCCGGTGACG 

20761 + + + + + + 20820 

TTCGGCAGACGCGTGCCTAGCCCGGTCCGGACGAGGTGCAGCCGCTCCTCGGGCCACTGC 
15 FGDAS PD PWAQEVDALLGTV- 

GTCTCCGGCGCGAGGCCGGGCCAGTACGGGGACTCGTGGAGCAGCAGTTCGCATCGGCCG 

20821 + + + + + + 20880 

CAGAGGCCGCGCTCCGGCCCGGTCATGCCCCTGAGCACCTCGTCGTCAAGCGTAGCCGGC 
15 TEPALGPWYPSEHLLLECRG- 

GTGGCGAGATCGGTGACCACGCTGCCCCGGTGCAGGACCATGCGTACGTCCGGCAGGCCA 

20881 + + + + + + 20940 

CACCGCTCTAGCCACTGGTGCGACGGGGCCACGTCCTGGTACGCATGCAGGCCGTCCGGT 
15 TALDTVVSGRHLVMRVDP LG- 

GGCCGGTTCTCGATGAGGTGGGGCAGGTCCTCGATGTAGCGGGCCGTGTCGTACCCGAAG 

20941 + + + + + + 21000 

CCGGCCAAGAGCTACTCCACCCCGTCCAGGAGCTACATCGCCCGGCACAGCATGGGCTTC 
15 PRNEILHPLDEIYRATDYGF- 

AACCCGAGGAACCCGAAGCGGAAGCCGGACGCGGACCCCTCGGCGTCGAACATGTCCCGC 

21001 + + + + + + 21060 

TTGGGCTCCTTGGGCTTCGCCTTCGGCCTGCGCCTGGGGAGCCGCAGCTTGTACAGGGCG 
15 FGLFGFRFGSASGEADFMDR- 

ATGGCCCGCAGCAGCGGCCACAACCCGCCCGCGGTACGCAGCCGCAGCCCCTGGGGGCCG 

21061 + + + + + + 21120 

TACCGGGCGTCGTCGCCGGTGTTGGGCGGGCGCCATGCGTCGGCGTCGGGGACCCCCGGC 
15 MARLLPWLGGATRLRLGQPG- 

TCCTCCAGGAGCGCGCCGGCCCGCTCCAGGAGCAGGCCCCGCAGGGCGGGTACGCCCTCG 

21121 + + + + + + 21180 

AGGAGGTCCTCGCGCGGCCGGGCGAGGTCCTCGTCCGGGGCGTCCCGCCCATGCGGGAGC 
15 DELLAGARELLLGRLAPV GE- 

ACGCGCACCACCCGGTCGGTGACCGAGAGCGAGAGCAGCGCGCCGAAGCCGACGAACTGG 

21181 + + + + + + 21240 

TGCGCGTGGTGGGCCAGCCACTGGCTCTCGCTCTCGTCGCGCGGCTTCGGCTGCTTGACC 
15 VRVVRDTVSLSLLAGFGVFQ- 

TGCCTGCGGTCGCGGGCCGGGCCGGCCGCGGACTCCAGGAGGTAGACCTCGTCGGGGCCG 

21241 + + + - + + + 21300 

ACGGACGCCAGCGCCCGGCCCGGCCGGCGCCTGAGGTCCTCCATCTGGAGCAGCCCCGGC 
15 HRRDRAPGAASELLYVEDPG 

AAGTGCTCGGCCAGCGCGCGGTAGGCGGGCAGGGCGCCCGTCTCCTTCACATCGAGGCGT 

21301 + + + + + + 21360 

TTCACGAGCCGGTCGCGCGCCATCCGCCCGTCCCGCGGGCAGAGGAAGTGTAGCTCCGCA 
15 FHEALARYAPLAGTEKVDLR- 

CGTGTCCGCACCCGCACCGGGGCCGAGACCACGCACTGGTCGGTCATCCTGGGTCCTCCC 

21361 + + + + + + 21420 

GCACAGGCGTGGGCGTGGCCCCGGCTCTGGTGCGTGACCAGCCAGTAGGACCCAGGAGGG 
15-< RTRVRV PASVVCQDTM- 

GGATCACGTGGTGATGGCGTAGCGGTGTGCCACCTGACGGGCGGTCAGCACCGCCCGGTC 

21421 + + + + + + 21480 

CCTAGTGCACCACTACCGCATCGCCACACGGTGGACTGCCCGCCAGTCGTGGCGGGCCAG 
14-* * TT I AYRHAVQRATLVARD- 

GGGGCCGGAGCGGTTGTCGACGACGCGCGCGGCCTTCCAGCTGACGAAGGAGCCGGTGTG 

21481 + + + + + + 21540 

CCCCGGCCTCGCCAACAGCTGCTGCGCGCGCCGGAAGGTCGACTGCTTCCTCGGCCACAC 
14 PGSRNDVVRAAKWSVF SGTH- 
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GGTCACGGGGTCGAGGTCGGTGTCCACGACGATGCCGGCGTGCGCGCCGGTCCGCTCCCT 

21541 + + + + + + 21600 

CCAGTGCCCCAGCTCCAGCCACAGGTGCTGCTACGGCCGCACGCGCGGCCAGGCGAGGGA 
14 TVPDLDTDVVI GAHAGTRER- 

GAGCCGGGCGGCGACGGCCTCGCCGATGCCCTGCCGTTCCCCCTCGGCGCCGGCCAGCAG 

21601 + + + + + + 21660 

CTCGGCCCGCCGCTGCCGGAGCGGCTACGGGACGGCAAGGGGGAGCCGCGGCCGGTCGTC 
14 LRAAVAEG I GQREGEAGALL- 

GTCCATGCGCACGGTGACGGCGTCGCTGCCGTCGTCCTGCCGGTCGATGACGACCTGGTA 

21661 + + + + + + 21720 

CAGGTACGCGTGCCACTGCCGCAGCGACGGCAGCAGGACGGCCAGCTACTGCTGGACCAT 
14 DMRVTVADSGDDQRD I V V Q Y - 

GCCGAGGCAGCCGCCGACCCCGTCGAGGATCGCGGCCTCCAGCTCGGCGGGCTGGAGGGT 

21721 + + + + + + 21780 

CGGCTCCGTCGGCGGCTGGGGCAGCTCCTAGCGCCGGAGGTCGAGCCGCCCGACCTCCCA 
14 GLCGGVGDLIAAELEAP.QLT- 

CACGTCGCCCAGGGGGATGCGGTCCGCGACCCGGCCGATGACCTGGATCCGCGGTCCCGG 

21781 + + + + + + 21840 

GTGCAGCGGGTCCCCCTACGCCAGGCGCTGGGCCGGCTACTGGACCTAGGCGCCAGGGCC 
14 VDGLP IRDAVRGIVQ I RPGP- 

CAGCGGCTCCCCGGGGCCCGCCGGGAGGATGCGGACCAGGTCCCCGGTGCGGTAGCGGAT 

21841 + + + + + + 21900 

GTCGCCGAGGGGCCCCGGGCGGCCCTCCTACGCCTGGTCCAGGGGCCACGCCATCGCCTA 
14 LPEGPGAPLIRVL DGTRYRI- 

CAGTGGTTTGATGCCGTCCACCAGCATGGTGAGGACGAGTTCGCCCTCTCCCGTGTCGCC 

21901 + + + + + + 21960 

GTCACCAAACTACGGCAGGTGGTCGTACCACTCCTGCTCAAGCGGGAGAGGGCACAGCGG 
14 LPKIGDVLMTLVLEGEGTDG- 

GACCACGGCGCCGGTGTCCGGTTCGACGAGTTCGGTCAAGTAGTTGGGCTGGGCGAGGTG 

21961 + + + + + + 22020 

CTGGTGCCGCGGCCACAGGCCAAGCTGCTCAAGCCAGTTCATCAACCCGACCCGCTCCAC 
14 VVAGTDPEVLETLYNPQALH- 

GAGCGCTCCGGTGTCCGCTCCGGTGGCGATGCACAGGGCTTCCTGGGAGCCGTAGAGCGT 

22021 + + + + + + 22080 

CTCGCGAGGCCACAGGCGAGGCCACCGCTACGTGTCCCGAAGGACCCTCGGCATCTCGCA 
14 LAGTDAGTAI CLAEQSGYLT- 

GGGCCGCACGACGGCTTGCGGCCAGAGGGTCGCCACGTTGTCGGCGAACTGCGGGGTGCA 

22081 + + + + + + 22140 

CCCGGCGTGCTGCCGAACGCCGGTCTCCCAGCGGTGCAACAGCCGCTTGACGCCCCACGT 
14 PRVVAQPWLTAVNDAFQPTC- 

GATCTCACCCAGCGTGAGGAAGAGCTTCACGGGAAGCCGGGCCAGGTCGTAGCCGTAGTG 

22141 + + + + + + 22200 

CTAGAGTGGGTCGCACTCCTTCTCGAAGTGCCCTTCGGCCCGGTCCAGCATCGGCATCAC 
14 IEGLTLFLKVPLRALDYGYH- 

CAGGGCCGCCTTGGCAAGGCTCAGGCACAGCGCCGGAGCACAGACGACGACCTCGACCTC 

22201 + + + + + + 22260 

GTCCCGGCGGAACCGTTCCGAGTCCGTGTCGCGGCCTCGTGTCTGCTGCTGGAGCTGGAG 
14 LAAKALSLCLAPACVVVEVE- 

CAGCTCCTCGATCAGCCGCAGCGCCTTACGGAATCCCACCCTGGGGGACTCGGGCCAGAT 

22261 + + + + + + 22320 

GTCGAGGAGCTAGTCGGCGTCGCGGAATGCCTTAGGGTGGGACCCCCTGAGCCCGGTCTA 
14 LEE ILRLAKRFGVRPSEPWI- 
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CTTGACGTGACAGGCCCCCAGCTCCGCTGCCACCGCGGTGAACACGTCCCCGAACGCGTA 

22321 + + + + + + 22380 

GAACTGCACTGTCCGGGGGTCGAGGCGACGGTI3GCGCCACTTGTGCAGGGGCTTGCGCAT 
14 KVHCAGL EAAVAT FVDGFAY- 

CAGCTCCGACGGCCCCATCAGGCCCACGACGGGCATCCGCCCCCCGAACCTCGCTTCCAG 

22381 + + + + + + 22440 

GTCGAGGCTGCCGGGGTAGTCCGGGTGCTGCCCGTAGGCGGGGGGCTTGGAGCGAAGGTC 
14 LESPGMLGVVPMRGGFRAEL- 

CATGCGGCGCCAGGACTCCCGGACGGCGATGTTGCTGGTCGCGATGTCCTTCTCGCCGCG 

22441 + + + + + + 22500 

GTACGCCGCGGTCCTGAGGGCCTGCCGCTACAACGACCAGCGCTACAGGAAGAGCGGCGC 
14 MRRWSERVAINSTAIDKEGR- 

TGGGCACGGGGTGGCCGCCCCGGTGGTCCCGGTGGTCTCGTAGTAGATGCGTGCTTCGTG 

22501 + + + + + + 22560 

ACCCGTGCCCCACCGGCGGGGCCACCAGGGCCACCAGAGCATCATCTACGCACGAAGCAC 
14 PC PTAAGTTGTTEYYI RAEH- 

CAGCGGGCCCGACAGGACGTCGTGCATCTCCCGCCGCAGGTCGTCCTTGGTGGTGAAGGG 

22561 + + + + + + 22620 

GTCGCCCGGGCTGTCCTGCAGCACGTAGAGGGCGGCGTCCAGCAGGAACCACCACTTCCC 
14 LPGSLVDHMERRLDDKTTFP- 

CAGGTCCGCCAGGTTCGCGGGGGTGACGGCCTCGACGTCCACGCCTGCCAGATGGCGGCG 

22621 + + + + + + 22680 

GTCCAGGCGGTCCAAGCGCCCCCACTGCCGGAGCTGCAGGTGCGGACGGTCTACCGCCGC 
14 LD ALNA P TVAE VD VGALH RR- 

GTAGAACGGCGAGCGGCGGGTGACGTGGCGCAGTACGGCCGTCAGCCGTTCGCCCTCCCA 

22681 + + + + + + 22740 

CATCTTGCCGCTCGCCGCCCACTGCACCGCGTCATGCCGGCAGTCGGCAAGCGGGAGGGT 
14 YFPSRRTVHRLVATLREGEW- 

GCGCTCGCGGTCGGCGGCGGTGAGTTCGCCGCGGTAGAACGCGTCGCTCACCTGCCCGTA 

22741 + + + + + + 22800 

CGCGAGCGCCAGCCGCCGCCACTCAAGCGGCGCCATCTTGCGCAGCGAGTGGACGGGCAT 
14 RERDAATLEGRYFADSVQGY- 

GGCGGACCAGAACTCGCTGTCCGCGTCGGGGTCCAGCGGCCCGGTCCCGCCGGGACCGGG 

22801 + + + + + + 22860 

CCGCCTGGTCTTGAGCGACAGGCGCAGCCCCAGGTCGCCGGGCCAGGGCGGCCCTGGCCC 
14 ASWFESDADPDLPGTGGPGP- 

CCGCCGGCCGTCTCTCACGGCTGTGCCTGGAGTTCGTTGAGCGCGAGGCCGACCCGCTCG 

22861 + + + + + + 22920 

GGCGGCCGGCAGAGAGTGCCGACACGGACCTCAAGCAACTCGCGCTCCGGGTGGGCGAGC 
14-< R R G D R M - 

21-* * PQAQLENLALGVRE - 

TTGACCTCGTTGGAGGCCAGCACGTCCGAACGGCCGGTGAGCCGACGGTGTTCGTCGAGC 

22921 + + + + + + 22980 

AACTGGAGCAACCTCCGGTCGTGCAGGCTTGCCGGCCACTCGGCTGCCACAAGCAGCTCG 
21 NVENSALVDSRGTLRRHEDL- 

AGTTCGATCATGTCCGTCATCCTCTCGACCAGGCGCGAGACGTTGGTGAGGCCCTCCTCG 

22981 + + + + + + 23040 

TCAAGCTAGTACAGGCAGTAGGAGAGCTGGTC CGCGCTCTGCAAC CACTCCGGGAGGAGC 
21 LEIMDTMREVLRSVNTLGEE 

TCCTTGAGCGCGTCGCCCCGGTGCAGCGCGTGCACCGTCGCCGGGAAGCCGCTGCCCACC 

23041 + + + + + + 23100 

AGGAACTCGCGCAGCGGGGCCACGTCGCGCACGTGGCAGCGGCCCTTCGGCGACGGGTGG 
21 DKLADGRHLAHVTAPFGSGV- 
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AGGATCATCCGGTTGAGCAGGGCATTGACGGTCAGCTGAGCCCATACCTCGCCGGCGCTG 

23101 + + + + + + 23160 

TCCTAGTAGGCCAACTCGTCCCGTAACTGCCAGTCGACTCGGGTATGGAGCGGCCGCGAC 
21 LIMRNLLANVTLQAWVEGAS 

TAGCGGCGGGCGACCGAGATGATCCCCGCGACCTTGTTGCTCAGCGGCCGGTCGAAGCGC 

23161 + + + + + + 23220 

ATCGCCGCCCGCTGGCTCTACTAGGGGCGCTGGAACAACGAGTCGCCGGCCAGCTTCGCG 
21 YRRAVS I IGAVKNSLPRDFR- 

AGATAACCGACTCCGGCACGCTCGATGAAGGTCTGCATGAGGCTGGCCGTGCCGAATCCG 

23221 + + + + + + 23280 

TCTATTGGCTGAGGCCGTGCGAGCTACTTCCAGACGTACTCCGACCGGCACGGCTTAGGC 
21 LYGVGA RE I FTQMLSATGFG- 

TGCACGGGCGCCGCGAAGATGATCCCGTCCGCCGCGACCATCTTCGCCACGACCTCGGGC 

23281 + + + + + + 23340 

ACGTGCCCGCGGCGCTTCTACTAGGGCAGGCGGCGCTGGTAGAAGCGGTGCTGGAGCCCG 
21 HVPAAF I IGDAAVMKAVVEP 

ACCCCGTCGGCCAGGGTGCAGGCCACCGGCCTGTCGTTGCAGTCCCCGCAGGGCCCGCAC 

23341 + + + + + + 23400 

21 VGDALTCAVPRDNCDGCPGC- 



CGCTCCATCCTGATCGAGCGCAGGTCGACGGCCTCGAAGTCGACGCCGCGGTTCTCTGCT 

23401 + + + + + + 23460 

GCGAGGTAGGACTAGCTCGCGTCCAGCTGCCGGAGCTTCAGCTGCGGCGCCAAGAGACGA 
21 REM R I S R LDVAE FDVG RNEA 

ACGCGTGCCGCGTGCCGCAGTACGTCGGCGGTGTTGCCGTCACGTTCCGAACCGTTGATC 

23461 + + + + + + 23520 

TGCGCACGGCGCACGGCGTCATGCAGCCGCCACAACGGCAGTGCAAGGCTTGGCAACTAG 
21 VRAAHRLVDATNGDRESGNI- 



GCGAGGATCTTGAGTTGTGCGCTCACGAGGGGCCTCCTTGGTGAGTCAGGTGCGCTCGGC 
23521 + + + + + + 23580 

CGCTCCTAGAACTCAACACGCGAGTGCTCCCCGGAGGAACCACTCAGTCCACGCGAGCCG 
13-* * T R E A - 

21-< ALIKLQASM- 

GGTCGGCTCGGGGGAACTGTCTGGCCGCCGCTGGTCCGGGAGCCGCAGGGCCGGCTCGGC 

23581 + + + + + + 23640 

CCAGCCGAGCCCCCTTGACAGACCGGCGGCGACCAGGCCCTCGGCGTCCCGGCCGAGCCG 
13 TPEPS SDPRRQDPLRLAPEA- 

GGGGGCGGGAGGAAGACCGCCCCGCGGCGGGCCGCCACGCTCGCCGAACCGGATGAGGGG 

23641 + + + + + + 23700 

CCCCCGCCCTCCTTCTGGCGGGGCGCCGCCCGGCGGTGCGAGCGGCTTGGCCTACTCCCC 
13 PAPPLGGRPPGGREGFRILP- 

CTTCTCGACGAGATAGAAGCTGATGGTCGCCAGCACGACGCTGATCGAGATCGTGAAGAG 

23701 + + + + + + 23760 

GAAGAGCTGCTCTATCTTCGACTACCAGCGGTCGTGCTGCGACTAGCTCTAGCACTTCTC 
13 KEVLYFS ITALVVS I S ITFL- 

GAACAGTTCCCAGAACCCCATGTCACCCCGGAATTCCGGCGTTGGCACGGGAGACTTGCC 

23761 + + + + + + 23820 

CTTGTCAAGGGTCTTGGGGTACAGTGGGGCCTTAAGGCCGCAACCGTGCCCTCTGAACGG 
13 FLEWFGMDGRFE PTPVPSKG- 



GAAGATGCTGCCGTTCCTGAGCCAGAGGTTGATCACGATCTCGTGCCAGAGGTAGACGCC 

23821 + + + + 4- + 23880 

CTTCTACGACGGCAAGGACTCGGTCTCCAACTAGTGCTAGAGCACGGTCTCCATCTGCGG 
13 FI SGNRLWLNIVI EHWLYVG- 



GAGGGAGATCTGGCCGAGGAAGAGGATCGGCTTGCTGGTGAAGAGCGCGTCCGAGAACCG 
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23881 + + + + + + 23940 

CTCCCTCTAGACCGGCTCCTTCTCCTAGCCGAACGACCACTTCTCGCGCAGGCTCTTGGC 
13 LSIQGLFLIPKSTFLADSFR- 

GGACTCGGCGCCGGGGACCGTCATCGGTGCCAGGAGCAGCAGGGTGAAGGAGGTCAGGAT 

23941 + + + + + + 24000 

CCTGAGCCGCGGCCCCTGGCAGTAGCCACGGTCCTCGTCGTCCCACTTCCTCCAGTCCTA 
13 SEAGPVTMPALLLLTF STLI- 

GAAGTGGTCGACGAGCTCCTGGGCCAGGGCCGCGTTGTCGCCCATGCCCGGGATGCCGAT 

24001 + + - + + + + 24060 

CTTCACCAGCTGCTCGAGGACCCGGTCCCGGCGCAACAGCGGGTACGGGCCCTACGGCTA 
13 FHDVLEQALAANDGMG P I G I - 

GGGCTTGGTGGCGTAGAGGAGGTACAGCGGGATGAGCGGGACCCAGCAGATCAGCGGGCG 

24061 + + + + + + 24120 

CCCGAACCACCGCATCTCCTCCATGTCGCCCTACTCGCCCTGGGTCGTCTAGTCGCCCGC 
13 PKTAYLLYLPILPVWC I L P R - 

CCGGATCACGAAACGGTAGAAGCCCGGGGTCCCTGGCGTCGCCTCGGCGTACGCGGAGTA 

24121 + + + + + + 24180 

GGCCTAGTGCTTTGCCATCTTCGGGCCCCAGGGACCGCAGCGGAGCCGCATGCGCCTCAT 
13 RIVFRYFGPTGPTAEAYASY- 

GATGGCCAGTGCCATGCCCGCGGCGAAGCAGCCGGCGTAGTAGGGCGGCCAGTACCACTG 

24181 + + + + + + 24240 

CTACCGGTCACGGTACGGGCGCCGCTTCGTCGGCCGC7VTCATCCCGCCGGTCATGGTGAC 
13 IALAMGAAFCGAYY P PWYWQ- 

CATCGTCGCGCCGGTGGAGGGGAGGTTGGTGTACGTGACCCAGCCGATGGCCATGACTTC 

24241 + + + + + + 24300 

GTAGCAGCGCGGCCACCTCCCCTCCAACCACATGCACTGGGTCGGCTACCGGTACTGAAG 
13 MTAGTS PLNTYTVWG I A M V E - 

CAGCGCGGCCAGCGGCAGCAGGAGGCGGCGTGCCTTCTGCCCGGGAGTGCTGCCGCCCCG 

24301 + + + + + + 24360 

GTCGCGCCGGTCGCCGTCGTCCTCCGCCGCACGGAAGACGGGCCCTCACGACGGCGGGGC 
13 LAALPLLLRRAKQGPTSGGR- 

CGCGAGCCGGTGGCCGATCCAGGCGATCAGCGGCAGGGCGAGGTAGAACGTGAACTCGGC 

24361 + + + + + + 24420 

GCGCTCGGCCACCGGCTAGGTCCGCTAGTCGCCGTCCCGCTCCATCTTGCACTTGAGCCG 
13 ALRHGIWAI LPLALYFTFEA- 

GGGGACCGTCCAGGTGGGCTCGATGCCGTGCATCGGCTGGCCCTCGGGCAGATAGAAGTG 

24421 + + + + + + 24480 

CCCCTGGCAGGTCCACCCGAGCTACGGCACGTAGCCGACCGGGAGCCCGTCTATCTTCAC 
13 PVTWTPE I GHMPQGE PLYFH- 

CATGAGCAGCACGGGCCGCAGGACGTCGCTGACGCTGTCGATCTCGAACCAGTTGTAGCC 

24481 + + + + + + 24540 

GTACTCGTCGTGCCCGGCGTCCTGCAGCGACTGCGACAGCTAGAGCTTGGTCAACATCGG 
13 MLLVPRLVDSVSD I EFWNYG- 

GGGGATTGCGAAGACGAGCAACAGGTAGTAGGCGGGCAGGATGCGCAGGGCCCGGCGTTT 

24541 + + + + + + 24600 

CCCCTAACGCTTCTGCTCGTTGTCCATCATCCGCCCGTCCTACGCGTCCCGGGCCGCAAA 
13 PIAFVLLLYYAPL I RLARRK- 

GAGGAACCGTCCGGTGGCGGGCCGCTTCGTCCCACTGATGGTGACGCGGGCGTAGGGCTT 

24601 + + + + + + 24660 

CTCCTTGGCAGGCCACCGCCCGGCGAAGCAGGGTGACTACCACTGCGCCCGCATCCCGAA 
13 LFRGTAPRKTGS I TVRAYPK- 

GTACAGCATCATTCCGGACAGAGCGAAGAAGGGGGAAGGCATAC CCCCAGAC CGTCCGCG 
24661 + + + + + + 24720 
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CATGTCGTAGTAAGGCCTGTCTCGCTTCTTCCCCCTTCCGTATGGGGGTCTGGCAGGCGC 
l3-< YLMMGSLAFFPSPM- 

AGGACGCCCCAGAACGGTTTGCCCGGCTCACCGACGAAGCTGCCCACTCCGGCCTGGAAG 

24721 + + + + + + 24780 

TCCTGCGGGGTCTTGCCAAACGGGCCGAGTGGCTGCTTCGACGGGTGAGGCCGGACCTTC 

GCGACGTGGTAGACGACCACACCCAGCGCGAGGACACCTCGCAGTCCCTCGAACTTCGGT 

24781 + + + + + + 24840 

CGCTGCACCATCTGCTGGTGTGGGTCGCGCTCCTGTGGAGCGTCAGGGAGCTTGAAGCCA 

ATTCGCTTGCTTTTTGCGCCACCTGCGTCGCGAAGGACGTCCCCCATGGAACAGTCCCCT 

24841 + + + + + + 24900 

TAAGCGAACGAAAAACGCGGTGGACGCAGCGCTTCCTGCAGGGGGTACCTTGTCAGGGGA 

TTCCCTTGGCACTTGCTCGTTGACTTCCCGAAATAGTCGGGTCTGCGGAGTGTGAGCCGC 

24901 + + + + + + 24960 

AAGGGAACCGTGAACGAGCAACTGAAGGGCTTTATCAGCCCAGACGCCTCACACTCGGCG 

ATCTCCAATCGTGCTGTTCCGGTGCTCAGGACGACTTGTTTCGGCCTGAGTGGGAAGGCA 

24961 + + + + + + 25020 

TAGAGGTTAGCACGACAAGGCCACGAGTCCTGCTGAACAAAGCCGGACTCACCCTTCCGT 
12-* *SSKNRGSHSP 

GCCACCCCCGCCGCCCCGCCTCGGCCAGACCGGGGGCCGAGGAGTCCCGTTCCGAGAGGA 

25021 + + + + + + 25080 

CGGTGGGGGCGGCGGGGCGGAGCCGGTCTGGCCCCCGGCTCCTCAGGGCAAGGCTCTCCT 
12 LWGRRGAEAL.GPASSDRESL 

TCGGAGTGATCTCCGGCGGCCAGGCGATGCCCACCTCCGGATCCAGCGGATTCAAGCCAT 

25081 + + + + + + 25140 

AGCCTCACTAGAGGCCGCCGGTCCGCTACGGGTGGAGGCCTAGGTCGCCTAAGTTCGGTA 
12 IPTIEPPWAIGVEPDLPNLG 

GTTCGAGCCGGGGGTCGTAGGCCGCCGAGCACAGGTAGACGATCACCGCCTCGTCGCTCA 

25141 + + + + + + 25200 

CAAGCTCGGCCCGCAGCATCCGGCGGCTCGTGTCCATCTGCTAGTGGCGGAGCAGCGAGT 
12 HELRPDYAASCLYVIVAEDS 

GCGTGAGGAATCCGAAGCCCAGCCCCGCGGAGACGTACAGCGCCCGTCCGTTCTCCTCGC 

25201 + + + + + + 25260 

CGCACTCCTTAGGCTTCGGGTCGGGGCGCCTCTGCATGTCGCGGGCAGGCAAGAGGAGCG 
12 LTLFGFGLGASVYLARGNEE 

CGAGCTCCACGGTCCGCCAGCCGCCGAAGGTGGGCGACCCCACCCGGATGTCGACCACGG 

25261 -f + + + + + 25320 

GCTCGAGGTGCCAGGCGGTCGGCGGCTTCCACCCGCTGGGGTGGGCCTACAGCTGGTGCC 
12 GLEVTRWGGFTPSGVRIDVV 

CGCCGAACACGCTGCCGCGCAGGCAGCTGAAGTACTTGGCCTGGCCGGGTACGCCCCCGG 

25321 + + + + + + 25380 

GCGGCTTGTGCGACGGCGCGTCCGTCGACTTCATGAACCGGACCGGCCCATGCGGGGGCC 
12 AGFVSGRLCSFYKAQGPVGG 

CGAAGTGGATGCCCCGCAGCACCCCGTGGGAGGAGATCGCGCAGTTCGCCTGCCGCAGGT 

25381 + + + + + + 25440 

GCTTCACCTACGGGGCGTCGTGGGGCACCCTCCTCTAGCGCGTCAAGCGGACGGCGTCCA 
12 AFH IGRLVGHSS IACNAQRL 

CGAAGGAGTGGCCTACGGTGCGGCGGAAGGGCTCGCCCTGGAACCACTCGCGAAACGAGC 

25441 + + + + + + 25500 

GCTTCCTCACCGGATGCCACGCCGCCTTCCCGAGCGGGACCTTGGTGAGCGCTTTGCTCG 
12 DFSHGVTRRFPEGQFWERFS 

CCCGTTCGTCACGGAAGACCTGCTTCTCCTCCGTCCACGCTCCCGAGATCCCGATCGGCT 
25501 -- + + + + + + 25560 
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GGGCAAGCAGTGCCTTCTGGACGAAGAGGAGGCAGGTGCGAGGGCTCTAGGGCTAGCCGA 
12 GREDRFVQKEETWAGS I GI P 

TCATCGCTGGCCCCTTCTCTCGACTTCTCTCGACGACTCGCGGGAGGCGGCCGAGGGGTC 

25561 + + + + + + 25620 

AGTAGCGACCGGGGAAGAGAGCTGAAGAGAGCTGCTGAGCGCCCTCCGCCGGCTCCCCAG 
12-< K M - 

CGCCGGGCCCGTGGGAACGCCGCAGTCTAGATGCGGCGGCACCGGGGGCAGGGGGGTGCG 

25621 + + + + + + 25680 

GCGGCCCGGGCACCCTTGCGGCGTCAGATCTACGCCGCCGTGGCCCCCGTCCCCCCACGC 

GACGACGTCCGCCCCACCTCAGCACACCGGGAGATGCAGGTCGGTGACGGGCGACGTGAC 

25681 + + + + + + 25740 

CTGCTGCAGGCGGGGTGGAGTCGTGTGGCCCTCTACGTCCAGCCACTGCCCGCTGCACTG 

GATGCAACGGTCCGAGGCCCGGTTGCCCGGACGACGGCCCACAGAGCCATCGGAGCAACG 

25741 + + -- + + + + 25800 

CTACGTTGCCAGGCTCCGGGCCAACGGGCCTGCTGCCGGGTGTCTCGGTAGCCTCGTTGC 

GAGGCGGACCGCAGATGACCAAGCACGCCCGTGACCGCGCGGTAGTCCTCGGCGCAGGGA 

25801 + + + + + + 25860 

CTCCGCCTGGCGTCTACTGGTTCGTGCGGGCACTGGCGCGCCATCAGGAGCCGCGTCCCT 
20-> MT KHARDRAVVL GAGM- 

TGGCGGGGCTGCTCGCCGCGCGCGTCCTGTCCGAGACGTACAAGGAAGTGCTGGTGATCG 

25861 + + + + + + 25920 

ACCGCCCCGACGAGCGGCGCGCGCAGGACAGGCTCTGCATGTTCCTTCACGACCACTAGC 
20 AGLLAARVLSETYKEVLVID- 

ACCGGGACCGGTTGGGCGGCACGGAGCAGCGCCGCGGTGTCCCGCACGGACGCCACGCCC 

25921 + + + + + + 25980 

TGGCCCTGGCCAACCCGCCGTGCCTCGTCGCGGCGCCACAGGGCGTGCCTGCGGTGCGGG 
20 RDRLGGTEQRRGVPHGRHAH- 

ATGCGCTGCTGGCCAAGGGACAGCAGATCCTCAACGAACTCTTCCCCGGACTCGACACCG 

25981 + + + + + + 26040 

TACGCGACGACCGGTTCCCTGTCGTCTAGGAGTTGCTTGAGAAGGGGCCTGAGCTGTGGC 
20 ALLAKGQQILNELFPGLDTE- 

AACTCACCTCGGCCGGAATCCCCGCCGGGGACATCGCCGGGAACCTGCGGTGGTACTTCA 

26041 + + + + + + 26100 

TTGAGTGGAGCCGGCCTTAGGGGCGGCCCCTGTAGCGGCCCTTGGACGCCACCATGAAGT 
20 LTSAGI PAGDIAGNLRWYFN- 

ACGGCCGCCGGCTCCAGCCCTTCGACACCGGGCTGATCAGCGTCTCGGCGACGAGGCCCG 

26101 + + + + + + 26160 

TGCCGGCGGCCGAGGTCGGGAAGCTGTGGCCCGACTAGTCGCAGAGCCGCTGCTCCGGGC 
20 GRRLQPFDTGLI SVSATRPE- 

AGCTGGAGTCCCACGTGCGCGCACGGGTCGCCGCGCTGGCACAGGTGAAGATCATGGACG 

26161 + + + + + + 26220 

TCGACCTCAGGGTGCACGCGCGTGCCCAGCGGCGCGACGGTGTCCACTTCTAGTACCTGC 
20 LE S HVRARVAAL P QVK I MDG- 

GGTGCGTGATCCGGGGCCTGACCGCCTCGGCCGACCGCAGCCGCGTCACCGGTGTCGAGG 

26221 + + + + + + 26280 

CCACGCACTAGGCCCCGGACTGGCGGAGCCGGCTGGCGTCGGCGCAGTGGCCACAGCTCC 
20 CV I RGLTASADRSRVT GVEV- 

TGGTCGACGAGTCGGGTACGGACACCCCGACGCGCCTGGAGGCCGACCTCGTCGTCGACG 

26281 + + + + + + 26340 

ACCAGCTGCTCAGCCCATGCCTGTGGGGCTGCGCGGACCTCCGGCTGGAGCAGCAGCTGC 
20 VDE SGTDTPTRLEADLVVDV- 



TCACGGGGCGCGGCTCGCGGACTCCCGCCTGGCTGGAGGAGTTCGGATACGAGCGGCCCG 
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26341 + + + + + + 26400 

AGTGCCCCGCGCCGAGCGCCTGAGGGCGGACCGACCTCCTCAAGCCTATGCTCGCCGGGC 
20 TGRGSRTPAWLEEFGYERPA- 

CGGAGGACCGCTTCAAGATCGATCTGGCGTACACCACGCGCCACTTCAAGCTCAAGGAAG 

26401 + + + + + + 26460 

GCCTCCTGGCGAAGTTCTAGCTAGACCGCATGTGGTGCGCGGTGAAGTTCGAGTTCCTTC 
20 EDRFKIDLAYTTRHFKLKED- 

ACCCCTACGGCACGGACCTGTCGATCAACCCGGTGGCATCGCCGAGCAACCCGCGCGGCG 

26461 + + + + + + 26520 

TGGGGATGCCGTGCCTGGACAGCTAGTTGGGCCACCGTAGCGGCTCGTTGGGCGCGCCGC 
20 PYGTDLS I NPVAS PSNPRGA- 

CGTTCTTCCCCCGGCTCGCGGACGGCAGCTCCCAGCTCTCCCTCACCGGAATCCTCGGCG 

26521 + + + + + + 26580 

GCAAGAAGGGGGCCGAGCGCCTGCCGTCGAGGGTCGAGAGGGAGTGGCCTTAGGAGCCGC 
20 FFPRLADGSSQLSLTGILGD- 

ACCACCCGCCCACCGACGACGAGGGCTTCCTGGCGTTCGCCAAGTCGCTTGCCGCGCCGG 

26581 + + + + + + 26640 

TGGTGGGCGGGTGGCTGCTGCTCCCGAAGGACCGCAAGCGGTTCAGCGAACGGCGCGGCC 
20 HP PTDDEGFLAFAKSLAAPE- 

AGATCTACCGGGCCGTCCGCGATGCCGAACCTCTCGACGAACCGGTCACCTTCCGCTTCC 

26641 + + + + + + 26700 

TCTAGATGGCCCGGCAGGCGCTACGGCTTGGAGAGCTGCTTGGCCAGTGGAAGGCGAAGG 
20 IYRAVRDAEPLDEPVTFRFP- 

CGGCGAGCGTCCGCCGCCGTTACGAGAGGCTGCGCCGTTTCCCCGGCGGGTTCCTCGTCA 

26701 + + + + + + 26760 

GCCGCTCGCAGGCGGCGGCAATGCTCTCCGACGCGGCAAAGGGGCCGCCCAAGGAGCAGT 
20 ASVRRRYERLRRFPGGFLVM- 

TGGGCGACGGCGTGTGCAGCTTCAACCCCGTCTACGGCCAGGGCATGACGGTCGCCGCCC 

26761 + + + + + + 26820 

ACCCGCTGCCGCACACGTCGAAGTTGGGGCAGATGCCGGTCCCGTACTGCCAGCGGCGGG 
20 GDGVC S FN PVYGQGMTVAAL- 

TGGAGGCCGTGGCGCTGCGGGACCACTTGCGCGACGCCCCGGACCCCGACGCCCTGCGCT 

26821 + + + + + + 26880 

ACCTCCGGCACCGCGACGCCCTGGTGAACGCGCTGCGGGGCCTGGGGCTGCGGGACGCGA 
20 EAVALRDHLRDAPD PDALRF- 

TCTTCCGGCGTATCTCCACGGTCATCGACGTTCCGTGGGACATCGCCGCCGGAGCGGATC 

26881 + + + + + + 26940 

AGAAGGCCGCATAGAGGTGCCAGTAGCTGCAAGGCACCCTGTAGCGGCGGCCTCGCCTAG 
20 FRRI STVI DVPWD IAAGADL- 

TGAACTTCCCCGGGGTGGAGGGCCCCCGCACC71TGAAGGTGAAGATGGCCAACGCCTACA 

26941 + + + + + + 27000 

ACTTGAAGGGGCCCCACCTCCCGGGGGCGTGGTACTTCCACTTCTACCGGTTGCGGATGT 
20 NF PGVEG PRTMKVKMANAYM- 

TGGCCCGCCTGCACGCAGCGGCAGCCGTCGACGGCGCGGTGACCGGGGCGTTCTTCCGGG 

27001 + + + + + + 27060 

ACCGGGCGGACGTGCGTCGCCGTCGGCAGCTGCCGCGCCACTGGCCCCGCAAGAAGGCCC 
20 ARLHAAAAVDGAVTGAFFRV- 

TGGCCGGGCTGGTGGACCCCCCGCAGGCCCTGATGCGCCCCTCCCTCGCCCTGCGGGTCA 

27061 + + + + + + 27120 

ACCGGCCCGACCACCTGGGGGGCGTCCGGGACTACGCGGGGAGGGAGCGGGACGCCCAGT 
20 AGLVD PPQALMRPSLALRVM- 

TGCGCAACTCCTCGGCGAAGCCGTCGGTCCCTTCGGGCGCCGCCGTATGACCGCGCGGCC 
27121 + + + + + + 27180 
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ACGCGTTGAGGAGCCGCTTCGGCAGCCAGGGAAGCCCGCGGCGGCATACTGGCGCGCCGG 
20-* RNSSAKPSVPSGAAV*- 

CGTCCGGGGCGGCTGCCGGGGCCAGGAGCCGACATGCGGGTGATGATCACGGTGTTCCCG 

27181 + + + + + + 27240 

GCAGGCCCCGCCGACGGCCCCGGTCCTCGGCTGTACGCCCACTACTAGTGCCACAAGGGC 
19 MRVMITVFP 

GCGCGGGCGCACTTCCTGCCGCTGGTGCCCTATGCCTGGGCCCTGCAGAGCGCGGGCCAC 

27241 + + + + + + 27300 

CGCGCCCGCGTGAAGGACGGCGACCACGGGATACGGACCCGGGACGTCTCGCGCCCGGTG 
19 ARAHFLPLVPYAWALQSAGH 

GAGGTATGTGTCGTGGCGCCCCCGGGCTATCCCACCGGGGTGGCCGACCCCGACTTCCAC 

27301 + + + + + + 27360 

CTCCATACACAGCACCGCGGGGGCCCGATAGGGTGGCCCCACCGGCTGGGGCTGAAGGTG 
19 EVCVVAP PGYPTGVAD PDFH 

GAGGCCGTCACCGCGGCCGGCCTGAAGTCGGTGACCTGCGGGCAGCCGCAGCCGCTGGCG 

27361 + + + + + + 27420 

CTCCGGCAGTGGCGCCGGCCGGACTTCAGCCACTGGACGCCCGTCGGCGTCGGCGACCGC 
19 EAVTAAGLKSVTCGQ PQPLA 

GTCCT^CGACCGCGACGACCCCGGCTACGCGGCGATGCTGCCGACCGCGGCGGAGTCGGAG 

27421 + + + + + + 27480 

CAGGTGCTGGCGCTGCTGGGGCCGATGCGCCGCTACGACGGCTGGCGCCGCCTCAGCCTC 
19 VHDRDDPGYAAMLPTAAESE 

CGCTACGTGGCGGCCCTCGGGATCAGCGAGAAGGAGCGCCCCACCTGGGACGTCTTCTAC 

27481 + + + + + + 27540 

GCGATGCACCGCCGGGAGCCCTAGTCGCTCTTCCTCGCGGGGTGGACCCTGCAGAAGATG 
19 RYVAALG I SEKERPTWDVFY 

CACTTCACCTTGCTGGCGATCCGCGACTACCATCCGCCGCGGCCGCGGCAGGACGTGGAC 

27541 + + + + + + 27600 

GTGAAGTGGAACGACCGCTAGGCGCTGATGGTAGGCGGCGCCGGCGCCGTCCTGCACCTG 
19 HFTLLAI RDYHPPRPRQDVD 

CAGGTGATCGAGTTCGCCCGGATCTGGCAGCCCGATCTGGTGCTGTGGGACGCCTGGTTC 

27601 + + + + + + 27660 

GTCCACTAGCTCAAGCGGGCCTAGACCGTCGGGCTAGACCACGACACCCTGCGGACCAAG 
19 QVIEFARIWQPDLVLWDAWF 

CCCTCGGGCGCGATCGCGGCGCGGGTCAGCGGCGCCGCGCACGCGCGGGTGCTCGTAGCC 

27661 + + + + + + 27720 

GGGAGCCCGCGCTAGCGCCGCGCCCAGTCGCCGCGGCGCGTGCGCGCCCACGAGCATCGG 
19 PSGAIAARVSGAAHARVLVA 

CCCGACTACACCGGCTGGGTCACCGAGCGGTTCGCCGCCGCGGGCCCCGCGGCGGGGGCC 

27721 + + + -- + + + 27780 

GGGCTGATGTGGCCGACCCAGTGGCTCGCCAAGCGGCGGCGCCCGGGGCGCCGCCCCCGG 
19 PDYTGWVTERFAAAGPAAGA 

GACCTCCTGGCCGAGACGATGCGGCCGCTGGCCGAGCGGTACGGCGTGGAGGTCGACGAC 

27781 + + + + - + + 27840 

CTGGAGGACCGGCTCTGCTACGCCGGCGACCGGCTCGCCATGCCGCACCTCCAGCTGCTG 
19 DLLAETMRPLAERYGVEVDD 

GATCTTCTGCTCGGACAGTGGACGGTCAATCCGTTCCCGGCGCCGATGAACCCGCCGACC 

27841 + + + + + + 27900 

CTAGAAGACGAGCCTGTCACCTGCCAGTTAGGCAAGGGCCGCGGCTACTTGGGCGGCTGG 
19 DLLLGQWTVNPFPAPMNPP T 

CGGCTCACGAACGTTCCGGTGCGCTACGTGCCCTACACCGGTGCCAGCGTCATGCCCGCG 

27901 + + + + + + 27960 

GCCGAGTGCTTGCAAGGCCACGCGATGCACGGGATGTGGCCACGGTCGCAGTACGGGCGC 
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19 RLTNVPVRYVPYTGASVMPA 

TGGCTGTACGCGCGGCCGTCGCGGCCGCGGGTGGCGCTGTCGCTCGGAGTGTCCGCGCGG 

27961 + + + + + + 28020 

ACCGACATGCGCGCCGGCAGCGCCGGCGCCCACCGCGACAGCGAGCCTCACAGGCGCGCC 
19 WLYARPSRPRVALSLGVSAR 

GCGTTCCTCAAGGGTGACTGGGGGCGTACCGCCAAACTGCTGGAAGCGGTCGCGGAGCTG 

28021 + + + + + + 28080 

CGCAAGGAGTTCCCACTGACCCCCGCATGGCGGTTTGACGACCTTCGCCAGCGCCTCGAC 
19 AFLKGDWGRTAKLLEAVAEL 

GACATCGAGGTGATCGCCACGCTCAACGACAACCAACTGGCGGAGAGCGGGCCGCTGCCG 

28081 + + + + + + 28140 

CTGTAGCTCCACTAGCGGTGCGAGTTGCTGTTGGTTGACCGCCTCTCGCCCGGCGACGGC 
19 DIEVIATLNDNQLAESGPLP 

GACAACGTCCACACCCTCGACTACGTACCGCTCGACCAGTTGCTGCCCACCTGCTCGGCC 

28141 + + + + + + 28200 

CTGTTGCAGGTGTGGGAGCTGATGCATGGCGAGCTGGTCAACGACGGGTGGACGAGCCGG 
19 DNVHTLDYVPLDQLLPTCSA 

GTCATCCACCACGGATCGACGGGCACCTTCGCCGCGGCGAGCGCGGCCGGGCTGCCCCAG 

28201 + + + + + + 28260 

CAGTAGGTGGTGCCTAGCTGCCCGTGGAAGCGGCGCCGCTCGCGCCGGCCCGACGGGGTC 
19 VIHHGSTGTFAAASAAGLPQ 

GTGGTCTGCGACACCGACGAGCCCCTCCTGCTCTTCGGCGAGGACACCCCCGACGGCATC 

28261 - + + + + + + 28320 

CACCAGACGCTGTGGCTGCTCGGGGAGGACGAGAAGCCGCTCCTGTGGGGGCTGCCGTAG 
19 VVCDTDEPLLLFGEDTPDGI 

GCGTGGGACTTCACCTGCCAGAAGCAGCTCACCGCGACGCTCACCTCCCGCGTGGTCACC 

28321 + + + + + + 28380 

CGCACCCTGAAGTGGACGGTCTTCGTCGAGTGGCGCTGCGAGTGGAGGGCGCACCAGTGG 
19 AWDFTCQKQLTATLTSRVVT 

GACTACGGGGCGGGGGTGCGCGTCGACCACCAGAAGCAGTCCGCCGGACAGATCCGTGAG 

28381 + + + + + + 28440 

CTGATGCCCCGCCCCCACGCGCAGCTGGTGGTCTTCGTCAGGCGGCCTGTCTAGGCACTC 
19 DYGAGVRVDHQKQSAGQ I RE 

CAACTACGCAGGGTGCTCACCGAACCTTCCTTCCGCGAGGGCGCTCGACGGATCCGGGAA 

28441 + + + + + + 28500 

GTTGATGCGTCCCACGAGTGGCTTGGAAGGAAGGCGCTCCCGCGAGCTGCCTAGGCCCTT 
19 QLRRVLTE PSFREGARRIRE 

GACCGGAATTCCGCCCCCAGCCCGGTCGAACTCGTATCGCTCCTGGTAGAACTGACGAAG 

28501 + + + + + + 28560 

CTGGCCTTAAGGCGGGGGTCGGGCCAGCTTGAGCATAGCGAGGACCATCTTGACTGCTTC 
19 DRNSAPS PVELVSLLVELTK 

CGTCATCGCCGTGACAAGGAGGCGGACCGATGAGGATGCTGGTGACGGGCGGAGCGGGTT 

28561 + + + + + + 28620 

GCAGTAGCGGCACTGTTCCTCCGCCTGGCTACTCCTACGACCACTGCCCGCCTCGCCCAA 
19-* RHRRDKEADR * - 

l-> MRMLVTGGAG F- 

TCATCGGCTCGCAGTTCGTGCGGGCCACACTGCACGGCGAGCTGCCGGGTTCCGAGGACG 

28621 + + + + + + 28680 

AGTAGCCGAGCGTCAAGCACGCCCGGTGTGACGTGCCGCTCGACGGCCCAAGGCTCCTGC 
1 I GSQFVRATLHGELPGS EDA- 

CCCGGGTGACGGTCCTGGACAAGCTGACGTACTCCGGCAATCCGGCCAACCTCACCTCCG 

28681 + + + + + + 28740 

GGGCCCACTGCCAGGACCTGTTCGACTGCATGAGGCCGTTAGGCCGGTTGGAGTGGAGGC 
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RVTVLDKLTYSGNPANLTSV- 

TCGCGGCCCATCCGCGGTACACCTTCGTCCAGGGCGACACCGTCGACCCGCGCGTCGTCG 

28741 + + + + + + 28800 

AGCGCCGGGTAGGCGCCATGTGGAAGCAGGTCCCGCTGTGGCAGCTGGGCGCGCAGCAGC 
AAHPRYTFVQGDTVDPRVVD- 

ACGAGGTGGTCGCCGGCCACGACGTCATCGTCCACTTCGCGGCGGAGTCGCACGTGGACC 

28801 + + + + + + 28860 

TGCTCCACCAGCGGCCGGTGCTGCAGTAGCAGGTGAAGCGCCGCCTCAGCGTGCACCTGG 
EVVAGHDV I VHFAAE S HVDR- 

GCTCGATCGACACCGCCACCCGGTTCGTCACGACCAACGTGCTCGGGACCCAGACGCTGC 

28861 + + + + + + 28920 

CGAGCTAGCTGTGGCGGTGGGCCAAGCAGTGCTGGTTGCACGAGCCCTGGGTCTGCGACG 
S I DTATRFVTTNVLGTQTLL- 

TGGAAGCGGCTCTCCGGCACGGGGTCGGCCGGTTCGTGCACGTGTCGACCGACGAGGTCT 

28921 + + + + + + 28980 

ACCTTCGCCGAGAGGCCGTGCCCCAGCCGGCCAAGCACGTGCACAGCTGGCTGCTCCAGA 
EAALRHGVGRFVHVSTDEVY- 

ACGGGTCGATCGCCTCCGGCTCATGGACCGAGGACACCCCGCTCGCCCCCAACGTCCCCT 

28981 + + + + + + 29040 

TGCCCAGCTAGCGGAGGCCGAGTACCTGGCTCCTGTGGGGCGAGCGGGGGTTGCAGGGGA 
GS IASGSWTEDTPLAPNVPY- 

ACGCGGCGTCGAAGGCGGGTTCGGACCTGATGGCGCTCGCCTGGCACCGCACCCGGGGCC 

29041 + + + + + + 29100 

TGCGCCGCAGCTTCCGCCCAAGCCTGGACTACCGCGAGCGGACCGTGGCGTGGGCCCCGG 
AAS KAGS DLMALAWHRTRGL- 

TGGACGTCGTCGTCACCCGGTGCACCAACAACTACGGTCCCTACCAGTACCCCGAGAAGG 

29101 + + + + + + 29160 

ACCTGCAGCAGCAGTGGGCCACGTGGTTGTTGATGCCAGGGATGGTCATGGGGCTCTTCC 
DVVVTRCTNNYGPYQY PEKV- 

TGATCCCGCTCTTCGTCACCAACATCCTCGACGGCTTGCGGGTGCCCCTGTACGGGGACG 

29161 + + + + + + 29220 

ACTAGGGCGAGAAGCAGTGGTTGTAGGAGCTGCCGAACGCCCACGGGGACATGCCCCTGC 
I PLFVTN I LDGLRVPLYGDG- 

GCGCCCACCGCCGGGACTGGCTGCACGTGTCCGACCACTGCCGGGCCATCCAGATGGTCA 

29221 + + + + + + 29280 

CGCGGGTGGCGGCCCTGACCGACGTGCACAGGCTGGTGACGGCCCGGTAGGTCTACCAGT 
AHRRDWLHVSDHCRA I QMVM- 

TGAACTCCGGCCGGGCCGGGGAGGTCTACCACATCGGCGGCGGCACCGAACTCTCCAACG 

29281 + + + + + + 29340 

ACTTGAGGCCGGCCCGGCCCCTCCAGATGGTGTAGCCGCCGCCGTGGCTTGAGAGGTTGC 
NSGRAGEVYH IGGGTELSNE- 

AGGAACTCACCGGCCTGTTGCTCACGGCGTGCGGCACCGACTGGTCCTGCGTGGACCGGG 

29341 + + + + + + 29400 

TCCTTGAGTGGCCGGACAACGAGTGCCGCACGCCGTGGCTGACCAGGACGCACCTGGCCC 
ELTGLLLTACGTDWS CVDRV- 

TGGCCGACCGGCAGGGGCACGACCGCCGCTACTCGCTCGACATCACGAAGATCCGGCAGG 

29401 + + + + + + 29460 

ACCGGCTGGCCGTCCCCGTGCTGGCGGCGATGAGCGAGCTGTAGTGCTTCTAGGCCGTCC 
ADRQGHDRRYSLD I TKI RQE- 

AACTGGGCTACGAGCCCCTGGTCGCCTTCGAGGACGGCCTGGCCGCGACGGTGAAGTGGT 

29461 + + + + + + 29520 

TTGACCCGATGCTCGGGGACCAGCGGAAGCTCCTGCCGGACCGGCGCTGCCACTTCACCA 
LGYE PLVAFEDGLAATVKWY- 
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ACCACGAGAACCGTTCGTGGTGGCAGCCGCTGAAGGAAGCGGCCGGCCTCCTGGACGCCG 

29521 + + + + + + 29580 

TGGTGCTCTTGGCAAGCACCACCGTCGGCGACTTCCTTCGCCGGCCGGAGGACCTGCGGC 

1 HENRSWWQPLKEAAGLLDAV- 

TCGGCTGACGGCAGCCACCGCTAGGAACACCCCAGGAAAGGAGCCACCTCCGTGACAGCA 
29581 + + + + + + 29640 

AGCCGACTGCCGTCGGTGGCGATCCTTGTGGGGTCCTTTCCTCGGTGGAGGCACTGTCGT 
2-> M T A 

1-* G * - 

GTCAAGGAGCCGACGTCCCGCGCAGGACGGCGGGAGTGGATCGCTCTCGTCGTCCTCTCC 

29641 + + + + + + 29700 

CAGTTCCTCGGCTGCAGGGCGCGTCCTGCCGCCCTCACCTAGCGAGAGCAGCAGGAGAGG 

2 VKEPTSRAGRREWIALVVLS 

TTGCCCACGATGCTGTTGATGCTGGACATCAACGTCCTCATGCTGGCCTTGCCGCAGTTG 

29701 + + + + + + 29760 

AACGGGTGCTACGACAACTACGACCTGTAGTTGCAGGAGTACGACCGGAACGGCGTCAAC 
2 LPTMLLMLDINVLMLALPQL 

AGCGAGGATCTCGGCGCGAGCAGCACGCAACAGCTGTGGATCACCGACATCTACGGATTC 

29761 + + + + + + 29820 

TCGCTCCTAGAGCCGCGCTCGTCGTGCGTTGTCGACACCTAGTGGCTGTAGATGCCTAAG 
2 SEDLGASSTQQL'WITDIYGF 

GCGATCGCCGGCTTCCTGGTGACCATGGGCACCCTCGGCGACCGGATCGGCCGCCGCAGG 

29821 + + + + + + 29880 

CGCTAGCGGCCGAAGGACCACTGGTACCCGTGGGAGCCGCTGGCCTAGCCGGCGGCGTCC 
2 AIAGFLVTMGTLGDRIGRRR 

CTCCTGCTCGGGGGCGCGGCCGTCTTCGCGGTCGTGTCCGTCGTCGCCGCGTTCTCCGAC 

29881 + - + + + + + 29940 

GAGGACGAGCCCCCGCGCCGGCAGAAGCGCCAGCACAGGCAGCAGCGGCGCAAGAGGCTG 
2 LLLGGAAVFAVVSVVAAFSD 

AGCGCGGCGATGCTCGTCGTCAGCCGCGCCGTGCTCGGCGTCGCCGGGGCCACGGTGATG 

29941 + + + + + + 30000 

TCGCGCCGCTACGAGCAGCAGTCGGCGCGGCACGAGCCGCAGCGGCCCCGGTGCCACTAC 
2 SAAMLVVSRAV LGVAGATVM 

CCCTCGACGCTCGCGCTCATCAGCAACATGTTCGAGGACCCCAAGGAGCGGGGCACCGCC 

30001 + + + + + + 30060 

GGGAGCTGCGAGCGCGAGTAGTCGTTGTACAAGCTCCTGGGGTTCCTCGCCCCGTGGCGG 
2 PSTLALISNMFED PKERGTA 

ATCGCCATGTGGGCGAGCGCCATGATGGCCGGAGTCGCCCTCGGGCCCGCCGTCGGCGGC 

30061 + + + + + + 30120 

TAGCGGTACACCCGCTCGCGGTACTACCGGCCTCAGCGGGAGCCCGGGCGGCAGCCGCCG 
2 IAMWASAMMAGVALGPAVGG 

CTGGTCCTCGCCGCGTTCTGGTGGGGATCGGTGTTCCTCATCGCCGTTCCGGTGATGCTG 

30121 + + + - + + + 30180 

GACCAGGAGCGGCGCAAGACCACCCCTAGCCACAAGGAGTAGCGGCAAGGCCACTACGAC 
2 LVLAAFWWGSVFL IAVPVML 

CTGGTGGTGGTCACCGGCCCCGTGCTGCTCACCGAGTCCCGCGACCCGGACGCCGGACGG 

30181 + + + + + + 30240 

GACCACCACCAGTGGCCGGGGCACGACGAGTGGCTCAGGGCGCTGGGCCTGCGGCCTGCC 
2 LVVVTGPVLLTESRDPDAGR 

CTGGACCTGCTGAGCGCGGGGCTCTCCCTCGCGACCGTGCTGCCGGTGATCTACGGACTG 

30241 + + + + + + 30300 

GACCTGGACGACTCGCGCCCCGAGAGGGAGCGCTGGCACGACGGCCACTAGATGCCTGAC 
2 LDLLSAGLSLATVLPVIYGL 
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AAGGAGCTGGCCCGGACCGGGTGGGACCCGCTCGCCGCCGGCGCGGTGGTCCTCGGCGTG 

30301 + + + + + + 30360 

TTCCTCGACCGGGCCTGGCCCACCCTGGGCGAGCGGCGGCCGCGCCACCAGGAGCCGCAC 
KELARTGWDPLAAGAVVLGV 

ATCTTCGGCGCGCTGTTCGTCCAGCGCCAGCGGCGGTTGGCCGACCCCATGCTGGACCTC 

30361 + + + --, + + + 30420 

TAGAAGCCGCGCGACAAGCAGGTCGCGGTCGCCGCCAACCGGCTGGGGTACGACCTGGAG 
IFGALFVQRQRRLADPMLDL 

GGCCTCTTCGCCGACCGCACCCTGCGGGCGGGTCTGACGGTCAGTCTGGTCAACGCCGTC 

30421 + + + + + + 30480 

CCGGAGAAGCGGCTGGCGTGGGACGCCCGCCCAGACTGCCAGTCAGACCAGTTGCGGCAG 
GLFADRTLRAGLTVS LVNAV 

ATCATGGGCGGGACCGGACTGATGGTCGCCCTGTACCTCCAGACGATCGCCGGTCACTCC 

30481 + + + + + + 30540 

TAGTACCCGCCCTGGCCTGACTACCAGCGGGACATGGAGGTCTGCTAGCGGCCAGTGAGG 
I MGGTGLrMVALYLQT IAGHS 

CCGTTGGCCGCCGGGCTGTGGCTGCTGATCCCGGCCTGCATGCTCGTCGTGGGCGTACAG 

30541 + + + + + + 30600 

GGCAACCGGCGGCCCGACACCGACGACTAGGGCCGGACGTACGAGCAGCACCCGCATGTC 
PLAAGLWLLiI PACMLVVGVQ 

CTGTCGAACCTGCTGGCCCAGCGGATGCCCCCTTCCCGGGTGCTGCTGGGGGGACTGCTG 

30601 + + + + + + 30660 

GACAGCTTGGACGACCGGGTCGCCTACGGGGGAAGGGCCC7VCGACGACCCCCCTGACGAC 
LSNLLAQRMPPSRVLLGGLL 

ATCGCGGCCGTCGGACAGCTCCTGATCACCCAGGTGGACACCGAGGACACCGCCCTCCTC 

30661 + + + + + + 30720 

TAGCGCCGGCAGCCTGTCGAGGACTAGTGGGTCCACCTGTGGCTCCTGTGGCGGGAGGAG 
IAAVGQLLITQVDTEDTALL 

ATCGCGGCCACCACCCTGATCTACTTCGGCGCCTCACCGGTGGGGCCGATCACCACGGGC 

30721 + + + + + + 30780 

TAGCGCCGGTGGTGGGACTAGATGAAGCCGCGGAGTGGCCACCCCGGCTAGTGGTGCCCG 
IAATTLIYFGASPVGP ITTG 

GCGATCATGGGAGCCGCGCCCCCGGAGAAGGCGGGTGCCGCCTCGTCGCTGTCCGCCACC 

30781 + + + + + + 30840 

CGCTAGTACCCTCGGCGCGGGGGCCTCTTCCGCCCACGGCGGAGCAGCGACAGGCGGTGG 
AIMGAAPPEKAGAASSLSAT 

GGCGGCGAGTTCGGAGTGGCGCTCGGCATCGCGGGCCTGGGGAGTCTGGGCACCGTCGTG 

30841 + + + + + + 30900 

CCGCCGCTCAAGCCTCACCGCGAGCCGTAGCGCCCGGACCCCTCAGACCCGTGGCAGCAC 
GGEFGVALGIAGLGSLGTVV 

TACAGCGCCGGGGTCGAGGTGCCGGACGCGGCCGGGCCCGCCGACGCCGACGCCGCGCAG 

30901 + + + + + + 30960 

ATGTCGCGGCCCCAGCTCCACGGCCTGCGCCGGCCCGGGCGGCTGCGGCTGCGGCGCGTC 
YSAGVEVPDAAGPADADAAQ 

GAGAGCATCGCCGGCGCCCTGCACACGGCCGGTCAGCTGGCACCGGGCAGCGCCGACGCC 

30961 + + + + + + 31020 

CTCTCGTAGCGGCCGCGGGACGTGTGCCGGCCAGTCGACCGTGGCCCGTCGCGGCTGCGG 
ESIAGALHTAGQLAPGSADA 

CTGCTGGACTCCGCGCGCGCGGCCTTCACCAGCGGCGTGCAGTCCGTCGCCGCCGTCTGC 

31021 + + + + + + 31080 

GACGACCTGAGGCGCGCGCGCCGGAAGTGGTCGCCGCACGTCAGGCAGCGGCGGCAGACG 
LLDSARAAFTSGVQSVAAVC 
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GCCGTGTTCTCCCTGGCGCTCGCCGTCCTCATCGGCACCCGGCTGCGGGACATTTCCGCG 

31081 + - + + + + + 31140 

CGGCACAAGAGGGACCGCGAGCGGCAGGAGTAGCCGTGGGCCGACGCCCTGTAAAGGCGC 

2 AVFSLALAVLIGTRLRDISA 

ATGGACCACGGGCACGGCGAGGAACCGGCCGAGAACGACGCTCAACCGGCCACATGAGCG 

31141 + + + + + + 31200 

TACCTGGTGCCCGTGCCGCTCCTTGGCCGGCTCTTGCTGCGAGTTGGCCGGTGTACTCGC 

2- * MDHGHGEE PAENDAQPAT * - 

CACTTCCGGAGATGCAACGGCCGCCGTCGAGGTATGAGGATCACCTTCCGGGGTGCACCT 

31201 + + + + + + 31260 

GTGAAGGCCTCTACGTTGCCGGCGGCAGCTCCATACTCCTAGTGGAAGGCCCCACGTGGA 

GCACGGCAACGGAGGCGTAGTGGAGTACTGGAACAGCACGGCGGAGACCATGCCCCGCCA 

31261 + + + -- + + + 31320 

CGTGCCGTTGCCTCCGCATCACCTCATGACCTTGTCGTGCCGCCTCTGGTACGGGGCGGT 

3- > MEYWNSTAETMPRQ- 

GGAACTCGAACAGTGGAAGTGGCGCAGGCTCCAGGCCGCCATGGACCACGCCAGAAGGCT 

31321 + + + + + + 31380 

CCTTGAGCTTGTCACCTTCACCGCGTCCGAGGTCCGGCGGTACCTGGTGCGGTCTTCCGA 

3 ELEQWKWRRLQAAMDHARRL 

TTCGCCCTTCTGGCGGGAACGACTCCCCGAGAACATCACCTCCATGGCGGACTACGCGGC 

31381 + + + + + + 31440 

AAGCGGGAAGACCGCCCTTGCTGAGGGGCTCTTGTAGTGGAGGTACCGCCTGATGCGCCG 
3 SPFWRERLPENITSMADYAA- 

GCGGGTGCCTCTCCTGCGCAAGGCCGACCTCCTCGCCGCGGAAGCCGCGTCTCCCCCTTA 

31441 + + + + + + 31500 

CGCCCACGGAGAGGACGCGTTCCGGCTGGAGGAGCGGCGCCTTCGGCGCAGAGGGGGAAT 
3 RVPLLRKADLLAAEAAS PPY 

CGGCACCTGGCCCTCGCTGGATCCGGCGCTCGGAGTGCGCCATCACCAGACCAGCGGCAC 

31501 + + + + + + 31560 

GCCGTGGACCGGGAGCGACCTAGGCCGCGAGCCTCACGCGGTAGTGGTCTGGTCGCCGTG 
3 GTWPSLDPALGVRHHQTSGT- 

CAGCGGTAACCCCCCCATCCGGACGTTCGACACCGAACGCGACTGGGCCTGGTGCGTGGA 

31561 + + + + + + 31620 

GTCGCCATTGGGGGGGTAGGCCTGCAAGCTGTGGCTTGCGCTGACCCGGACCACGCACCT 
3 SGNPP I-RTFDTERDWAWCVD 

CACGTTCTGCACGGCGCTCCACAGCATGGGCGTGCGCCCGCACCACT^AGGGTCTGGTGGC 

31621 + + + + + + 31680 

GTGCAAGACGTGCCGCGAGGTGTCGTACCCGCACGCGGGCGTGGTGTTCCCAGACCACCG 
3 TFCTALHSMGVRPHHKGLVA 

GTTCGGCTACGGGCTGTTCGCCGGTTTCTGGGGCATGCACTACGGCCTCGAGCGCATGGG 

31681 + + + + + + 31740 

CAAGCCGATGCCCGACAAGCGGCCAAAGACCCCGTACGTGATGCCGGAGCTCGCGTACCC 
3 FGYGLFAGFWGMHYG LERMG 

CGCCACGGTCATCCCGGCCGGCGGCCTCGACTCCCGCTCCCGGGTACGGCTGCTGGTCGA 

31741 + + + + + + 31800 

GCGGTGCCAGTAGGGCCGGCCGCCGGAGCTGAGGGCGAGGGCCCATGCCGACGACCAGCT 
3 ATVI PAGGLDSRSRVRLLVD- 

CTACCAGATCGAGGTGCTCGGCCTCACACCGAGCTATGCGATGCGGCTGATCGAGACGGC 

31801 + + + + + + 31860 

GATGGTCTAGCTCCACGAGCCGGAGTGTGGCTCGATACGCTACGCCGACTAGCTCTGCCG 
3 YQIEVLGLTPSYAMRLIETA- 

CCGCGAGATGGGCATCGACCTCGCCCGCGAGGCTAACGTCCAGATCATCCTGGCCGGGGC 
31861 + + + + + + 31920 
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GGCGCTCTACCCGTAGCTGGAGCGGGCGCTCCGATTGCAGGTCTAGTAGGACCGGCCCCG 
3 REMGIDLAREANVQ I I LAGA 

GGAGCCGCGCTCCGCGTTCACCACCCGCACCATCGAGGAGGCCTTCGGCGCCCGGGTCTT 

31921 + + + + + + 31980 

CCTCGGCGCGAGGCGCAAGTGGTGGGCGTGGTAGCTCCTCCGGAAGCCGCGGGCCCAGAA 
3 EPRSAFTTRTIEEAFGARVF 

CAACGCCGCGGGCACCACTGAGTTCGGGGGGGTGTTCATGTTCGAGTGCACCGCCCGGCG 

31981 + + + + + + 32040 

GTTGCGGCGCCCGTGGTGACTCAAGCCCCCCCACAAGTACAAGCTCACGTGGCGGGCCGC 
3 NAAGTTE FGGVFMFECTARR 

CGAGGCCTGCCACATCATCGAACCCTCGTGCATCGAGGAGGTGCTCGACCCGGTGACGGA 

32041 + + + + + + 32100 

GCTCCGGACGGTGTAGTAGCTTGGGAGCACGTAGCTCCTCCACGAGCTGGGCCACTGCCT 
3 EACHIIEPSCIEEVLDPVTE- 

ACAGCCCGTCGGCTACGGCGAGGAGGGCGTCCGAGTCACCACCGGGCTGAACCGTGAGGG 

32101 + + + + + + 32160 

TGTCGGGCAGCCGATGCCGCTCCTCCCGCAGGCTCAGTGGTGGCCCGACTTGGCACTCCC 
3 QPVGYGEEGVRVTTGLNREG- 

GATGCAGCTCTTCCGGCACTGGACCGAGGACGTCGTGGTCAAGCGGCCCCACACCGAGTG 

32161 + + + + + + 32220 

CTACGTCGAGAAGGCCGTGACCTGGCTCCTGCAGCACCAGTTCGCCGGGGTGTGGCTCAC 
3 MQL'FRHWTEDVVVKRPHTEC 

CGGCTGCGGCCGGACGTGGGACTTCTACGACGGCGGCATCCTTCGGCGCGTGGACGACAT 

32221 + + + + + + 32280 

GCCGACGCCGGCCTGCACCCTGAAGATGCTGCCGCCGTAGGAAGCCGCGCACCTGCTGTA 
3 GCGRTWDFYDGGI LRRVDDM 

GCGCAAGATACGCGGGGTCTCGATCACCCCGGTGATGATCGAGGATGTGCTGCGCGGCTT 

32281 + + + + + + 32340 

CGCGTTCTATGCGCCCCAGAGCTAGTGGGGCCACTACTAGCTCCTACACGACGCGCCGAA 
3 RKI RGVS ITPVMI EDVLRGF 

CGACGAGGTGAACGAGTTCCACTCGTCCATCCGGACCGTCCGCGGACTCGATACGATCCA 

32341 + + + + + + 32400 

GCTGCTCCACTTGCTCAAGGTGAGCAGGTAGGCCTGGCAGGCGCCTGAGCTATGCTAGGT 
3 DEVNEFHSS IRTVRGLDTIH- 

CGTCAAGGTCGAGGCGGGAGACATCTCGGGTGAGGCGGCCGAGAGCCTGTGCGGCCGCAT 

32401 + + + + + + 32460 

GCAGTTCCAGCTCCGCCCTCTGTAGAGCCCACTCCGCCGGCTCTCGGACACGCCGGCGTA 
3 VKVEAGDISGEAAESLCGRI- 

CACCGAGGAGTTCAAGCGTGAGATAGGCATACGGCCCCAGGTGGAGCTGACCCCCGCGGG 

32461 + + + + + + 32520 

GTGGCTCCTCAAGTTCGCACTCTATCCGTATGCCGGGGTCCACCTCGACTGGGGGCGCCC 
3 TE E FKRE I G I RPQVELT PAG 

CAGCCTCCCCCGATCGAAGTGGAAGGCGGCACGACTTCATGACGAGCGCGAACTCGCCCC 

32521 + + + + + + 32580 

GTCGGAGGGGGCTAGCTTCACCTTCCGCCGTGCTGAAGTACTGCTCGCGCTTGAGCGGGG 
3 SLPRSKWKAARLHDERELAP- 

TCAGGCCTGAGCAGGTGGAGCAGCTCCTGGTGAGCTACCGGAGCCTGGGCCTGCTGGAGC 

32581 + + + + + + 32640 

AGTCCGGACTCGTCCACCTCGTCGAGGACCACTCGATGGCCTCGGACCCGGACGACCTCG 
3-* Q A * - 

AGAGCTGCGCGGTCCCGGCCGTGCTCGCCGCGGTCAGGGCCGCCCGTGCGGAACTCCGTA 

32641 + + + + + + 32700 

TCTCGACGCGCCAGGGCCGGCACGAGCGGCGCCAGTCCCGGCGGGCACGCCTTGAGGCAT 
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TCGCCCTGGACGGCCAGGGCGTGGAGTTCGAGTACTACCGGGGGCACGACGACAGCCTCG 

32701 + + + + -- -- + + 32760 

AGCGGGACCTGCCGGTCCCGCACCTCAAGCTCATGATGGCCCCCGTGCTGCTGTCGGAGC 

. TGGCCTGAACCCACCCCCGGTCCGCCGGGTCAGACGAAAGGGAGACCGGTGCCCCACGGT 

32761 + + + + + + 32820 

ACCGGACTTGGGTGGGGGCCAGGCGGCCCAGTCTGCTTTCCCTCTGGCCACGGGGTGCCA 
4-> M P H G 

GCAGAGCGCGAAGCGAGCCCGGCCGAGGAGAGCGCCGGCACCCGGCCGCTGACCGGCGAG 

32821 + + + + + + 32880 

CGTCTCGCGCTTCGCTCGGGCCGGCTCCTCTCGCGGCCGTGGGCCGGCGACTGGCCGCTC 
4 AEREASPAEESAGTRPLTGE 

GAGTATCTGGAGAGCCTGCGGGACGCGCGGGAGGTGTACCTCGACGGCAGCCGCGTCAAG 

32881 + + + + + + 32940 

CTCATAGACCTCTCGGACGCCCTGCGCGCCCTCCACATGGAGCTGCCGTCGGCGCAGTTC 
4 EYLES LRDAREVYLDGSRVK 

GACGTCACCGCGCATCCCGCGTTCCACAACCCGGCCCGGATGACGGCCCGGCTGTACGAC 

32941 + + + + + + 33000 

CTGCAGTGGCGCGTAGGGCGCAAGGTGTTGGGCCGGGCCTACTGCCGGGCCGACATGCTG 
4 DVTAHPAFHNPARMTARLYD 

AGCCTGCACGACCCCGCCCAGAAAGCGGTCCTGACGGCGCCCACCGATGCCGGTGACGGT 

33001 + + + + + + 33060 

TCGGACGTGCTGGGGCGGGTCTTTCGCCAGGACTGCCGCGGGTGGCTACGGCCACTGCCA 
4 SLHDPAQKAVLTAPTDAGDG 

TTCACCCACCGCTTCTTCACCGCACCGCGCAGCGTCGACGACCTGGTCAAGGACCAGGCC 

33061 + + + + + + 33120 

AAGTGGGTGGCGAAGAAGTGGCGTGGCGCGTCGCAGCTGCTGGACCAGTTCCTGGTCCGG 
4 FTHRFFTAPRSVDDLVKDQA 

GCCATCGCATCCTGGGCGCGCAAGAGCTACGGCTGGATGGGGCGCAGCCCCGACTACAAG 

33121 + + + + + + 33180 

CGGTAGCGTAGGACCCGCGCGTTCTCGATGCCGACCTACCCCGCGTCGGGGCTGATGTTC 
4 AIASWARKSYGWMGRS PDYK 

GCGTCGTTCCTCGGCACGCTGGGGGCCAACGCCGACTTCTACGAGCCCTTCGCGGACAAC 

33181 + + + + + + 33240 

CGCAGCAAGGAGCCGTGCGACCCCCGGTTGCGGCTGAAGATGCTCGGGAAGCGCCTGTTG 
4 ASFLGTLGANADFYEPFADN 

GCCCGGCGCTGGTACCGGGAGTCGCAGGAGAAGGTGCTGTACTGGAAC CATGC CTTCCTT 

33241 + + - + + + + 33300 

CGGGCCGCGACCATGGCCCTCAGCGTCCTCTTCCACGACATGACCTTGGTACGGAAGGAA 
4 ARRWYRE SQEKVLYWNHAFL 

CACCCGCCGGTCGACCGCTCGCTGCCCGCCGACGAGGTGGGCGACGTCTTCATCCACGTC 

33301 + + + -- - + + + 33360 

GTGGGCGGCCAGCTGGCGAGCGACGGGCGGCTGCTCCACCCGCTGCAGAAGTAGGTGCAG 
4 HPPVDRSLPADEVGDVFIHV 

GAGCGGGAGACCGACGCGGGCCTGGTGGTGAGCGGGGCCAAGGTCGTCGCGACCGGATCG 

33361 + + + + + + 33420 

CTCGCCCTCTGGCTGCGCCCGGACCACCACTCGCCCCGGTTCCAGCAGCGCTGGCCTAGC 
4 ERETDAGLVVSGAKVVATGS 

GCCCTCACCCACGCGGCGTTCATCTCGCACTGGGGACTTCCCATCAAGGACCGGAAGTTC 

33421 + + + + + + 33480 

CGGGAGTGGGTGCGCCGCAAGTAGAGCGTGACCCCTGAAGGGTAGTTCCTGGCCTTCAAG 
4 ALTHAAFISHWGLPIKDRKF 

GCCCTGGTGGCCACCGTGCCGATGGACGCGGACGGCCTCAAGGTGATCTGCCGTCCCTCC 
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33481 + + + + + --- + 33540 

CGGGACCACCGGTGGCACGGCTACCTGCGCCTGCCGGAGTTCCACTAGACGGCAGGGAGG 
ALVATVPMDADGLKVI CRPS 

TACTCCGCAAACGCGGCGACCACGGGCAGCCCGTTCGACAACCCGCTGTCCTCACGGCTG 

33541 + + + + + + 33600 

ATGAGGCGTTTGCGCCGCTGGTGCCCGTCGGGCAAGCTGTTGGGCGACAGGAGTGCCGAC 
YSANAATTGS PFDNPLS S R L 

GACGAGAACGACGCCATCCTCGTACTCGACCAGGTGCTGATCCCCTGGGAGAACGTGTTC 

33601 + + + + + + 33660 

CTGCTCTTGCTGCGGTAGGAGCATGAGCTGGTCCACGACTAGGGGACCCTCTTGCACAAG 
DENDAILVLDQVLI PWENVF 

GTCTACGGCAACCTGGGCAAGGTACATCTCCTCGCCGGACAGTCCGGGATGATCGAACGC 

33661 + + + + + + 33720 

CAGATGCCGTTGGACCCGTTCCATGTAGAGGAGCGGCCTGTCAGGCCCTACTAGCTTGCG 
VYGNLGKVHL.LAGQSGMIER 

GCCACCTTCCACGGGTGCACCCGGCTCGCCGTGAAGCTGGAGTTCATCGCCGGGCTGCTG 

33721 + + + + + + 33780 

CGGTGGAAGGTGCCCACGTGGGCCGAGCGGCACTTCGACCTCAAGTAGCGGCCCGACGAC 
ATFHGCTRLAVKLEF IAGLL 

GCCAAGGCGCTGGACATCACCGGGGCGAAGGACTTCCGCGGTGTGCAGACCCGGCTCGGA 

33781 + + + + + + 33840 

CGGTTCCGCGACCTGTAGTGGCCCCGCTTCCTGAAGGCGCCACACGTCTGGGCCGAGCCT 
AKALDITGAKDFRGVQTRLG 

GAAGTCCTGGCCTGGCGCAACCTCTTCTGGTCACTGTCGGACGCGGCGGCCCGCAACCCC 

33841 + + + + + + 33900 

CTTCAGGACCGGACCGCGTTGGAGAAGACCAGTGACAGCCTGCGCCGCCGGGCGTTGGGG 
EVLAWRNLFWSLSDAAARNP 

GTCCCCTGGAAGAACGGCACGCTCCTGCCCAACCCTCAGGCGGGTATGGCCTACCGCTGG 

33901 + + + + + + 33960 

CAGGGGACCTTCTTGCCGTGCGAGGACGGGTTGGGAGTCCGCCCATACCGGATGGCGACC 
VPWKNGTLLPNPQAGMAYRW 

TTCATGCAGATCGGCTACCCGCGGGTCCTGGAGATCGTCCAACAGGACGTGGCCAGCGGC 

33961 + + + + + + 34020 

AAGTACGTCTAGCCGATGGGCGCCCAGGACCTCTAGCAGGTTGTCCTGCACCGGTCGCCG 
FMQIGYPRVLE IVQQDVASG 

CTCATGTACGTCAACTCCTCCACGGAGGACTTCCGCAACCCCGAGACCGGCCCCTACTTG 

34021 + + + + + + 34080 

GAGTACATGCAGTTGAGGAGGTGCCTCCTGAAGGCGTTGGGGCTCTGGCCGGGGATGAAC 
LMYVNSSTEDFRNPETGPYL 

GAGAAGTACCTCCGGGGCAGCGACGGCGCAGGCGCCGTCGAGCGTGTCAAGGTGATGAAG 

34081 + + + + + + 34140 

CTCTTCATGGAGGCCCCGTCGCTGCCGCGTCCGCGGCAGCTCGCACAGTTCCACTACTTC 
EKYLRGSDGAGAVERVKVMK 

CTGCTGTGGGACGCGGTGGGATCCGACTTCGGCGGCCGGCACGAACTCTACGAGCGGAAC 

34141 + + + + + + 34200 

GACGACACCCTGCGCCACCCTAGGCTGAAGCCGCCGGCCGTGCTTGAGATGCTCGCCTTG 
LLWDAVGSDFGGRHELYERN 

TACTCCGGGAACCACGAGAACACCCGGATCGAGTTGCTGCTGTCGCAGACGGCGAGCGGC 

34201 + + + + + + 34260 

ATGAGGCCCTTGGTGCTCTTGTGGGCCTAGCTCAACGACGACAGCGTCTGCCGCTCGCCG 
YS GNHENTRIELLLSQTASG 

AAACTGGACTCGTACATGGACTTCGCCCAGGCATGCATGGACGAGTACGACCTGGACGGC 
34261 + + + + + + 34320 
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TTTGACCTGAGCATGTACCTGAAGCGGGTCCGTACGTACCTGCTCATGCTGGACCTGCCG 
4 KLDSYMDFAQACMDEYDLDG 

TGGACCGCTCCCGACCTGGAGTCGTTTCACGCGATGCGTTCCGCCTCCCGCGACCTTCTC 

34321 + + + + + + 34380 

ACCTGGCGAGGGCTGGACCTCAGCAAAGTGCGCTACGCAAGGCGGAGGGCGCTGGAAGAG 
4 WTAPDLESFHAMRSASRDLL 

GGAGGGCTGTAGTTCCCCGACGGTGTACTGCGGCCCCCGATCCGGGGGCCGCAGTACACC 

34381 + + + + + + 34440 

CCTCCCGACATCAAGGGGCTGCCACATGACGCCGGGGGCTAGGCCCCCGGCGTCATGTGG 
4-* G G L * - 

GTCGGGGCGGCTGGTGCTCAGCCGCGCAGGAATCCGATGAGCTCGGGGGCGAGCTTCTTG 

34441 + + + + + + 34500 

CAGCCCCGCCGACCACGAGTCGGCGCGTCCTTAGGCTACTCGAGCCCCCGCTCGAAGAAC 
22-* *GRLFGILEPALKK- 

GGCGCCATGGCGACGGCACCGTGGTTGAGCCCGTTCAGGGTGCGGTGGCTCGCGTCGGGG 

34501 + + + + + + 34560 

CCGCGGTACCGCTGCCGTGGCACCAACTCGGGCAAGTCCCACGCCACCGAGCGCAGCCCC 
22 PAMAVAGHNLGNLTRHSADP- 

AGGACTCCGGTGAGTTCCTTCGCGGCACGCTGGAAACCGTCGGGGCTCTTGGAACCGGTC 

34561 + + + + + + 34620 

TCCTGAGGCCACTCAAGGAAGCGCCGTGCGACCTTTGGCAGCCCCGAGAACCTTGGCCAG 
22 LVGTLEKAARQFGDPS KSGT 

AGCACCAGGGTCGGGGCCGACGCCGCCGACCACGGCTCGGCGGGGAGCGGCTTGCCCTGC 

34621 + + + + + + 34680 

TCGTGGTCCCAGCCCCGGCTGCGGCGGCTGGTGCCGAGCCGCCCCTCGCCGAACGGGACG 
22 LVLTPASAASWPEAPLPKGQ 

TGGGTGTCGCCCATCACCGCGATGTCGTAGGGAAGCGTGTTGGCCAGACCCTTGAGGTTG 

34681 + + + + + + 34740 

ACCCACAGCGGGTAGTGGCGCTACAGCATCCCTTCGCACAACCGGTCTGGGAACTCCAAC 
22 QTDGMVAIDYPLTNALGKLN- 

GACCAGACACCGGGCATCAGGCGCATGGCGCCGACCATGAAGGAGGGCATGCCCTGTGCC 

34741 + + + + + + 34800 

CTGGTCTGTGGCCCGTAGTCCGCGTACCGCGGCTGGTACTTCCTCCCGTACGGGACACGG 
22 SWVGPMLRMAGVMFS PMGQA- 

TTGACCATGAAGGCCTTGACCGCGTCGCTGCGTCGGTCCTCCGCCAGAAGGCTGTCGATC 

34801 + + + + + + 34860 

AACTGGTACTTCCGGAACTGGCGCAGCGACGCAGCCAGGAGGCGGTCTTCCGACAGCTAG 
22 KVMFAKVADSRRDEALLSDI- 

TGACCGCCGAAGCCGGCGGGCGGGCCGAAGCCGTCCGAGGTGACGGAGAACGGCGGCTCG 

34861 + + + + + + 34920 

ACTGGCGGCTTCGGCCGCCCGCCCGGCTTCGGCAGGCTCCACTGCCTCTTGCCGCCGAGC 
22 QGGFGAPPGFGDSTVSFPPE- 

TAGACCGCGAGCTTGTTCACCTTCAGGCCGGCGGCGGCGGCTCGCAGGGCGAGCACCGCG 

34921 + + + + + + 34980 

ATCTGGCGCTCGAACAAGTGGAAGTCCGGCCGCCGCCGCCGAGCGTCCCGCTCGTGGCGC 
22 YVALKNVKLGAAAARLALVA- 

CCGGAAGAGCTGCCGAACAGGGAGGCCGAACCGCCGACCTGGTCGATCAGCGCCGCGATG 

34981 + + + + + + 35040 

GGCCTTCTCGACGGCTTGTCCCTCCGGCTTGGCGGCTGGACCAGCTAGTCGCGGCGCTAC 
22 GSSSGFLSASGGVQDILAAI- 

TCCTCGATCTCGCGCTCGACCGCGTACGCCGGACCGTCGGCGCTGGCGCCGCGGCCCCGA 

35041 + + + + + + 35100 

AGGAGCTAGAGCGCGAGCTGGCGCATGCGGCCTGGCAGCCGCGACCGCGGCGCCGGGGCT 
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22 DEIEREVAYAP. GDASAGRGR- 

CGGTCGTAGTTGACGACCGTGAAGTGCTCGGCGAGGAGACCGGCGAGCTTCTTGGCGTCG 

35101 + + + + + + 35160 

GCCAGCATCAACTGCTGGCACTTCACGAGCCGCTCCTCTGGCCGCTCGAAGAACCGCAGC 
22 RDYNVVTFHEALLGALKKAD 

GAGCGGTCGGCCAAGGCGGAGGCCACCAGGATCACCGCCGGCCCCTCGCCCGACTTGTCG 

35161 + + + + + + 35220 

CTCGCCAGCCGGTTCCGCCTCCGGTGGTCCTAGTGGCGGCCGGGGAGCGGGCTGAACAGC 
22 SRDALASAVLIVAPGEGSKD- 

AAGGCGATCGTGGTGCCGTCGGCCGATACCGTCGTTGATTCCACCTTGGCTGCTTTCTCA 

35221 + + + + + + 35280 

TTCCGCTAGCACCACGGCAGCCGGCTATGGCAGCAACTAAGGTGGAACCGACGAAAGAGT 
22 FIATTGDASVTTSEVKAAKE 

CGGGTTGAAGACATAGCTTCCCTCAGATCACATTGTGGGGCGTGCTGCCGACAGTGGAGA 

35281 + + + + + - + 35340 

GCCCAACTTCTGTATCGAAGGGAGTCTAGTGTAACACCCCGCACGACGGCTGTCACCTCT 
22-< R T S S M - 

CCGGCGTCCGGAGGAAAAGTAATCGGTCCTGCCAGAATTGGGGGTTCCGGAGGGCACGCC 

35341 + + + + + + 35400 

GGCCGCAGGCCTCCTTTTCATTAGCCAGGACGGTCTTAACCCCCAAGGCCTCCCGTGCGG 

GACCGCTGCACGACGGCGCGCCCCGACCTTCCGGACATTGTCGTGCCCTCAGATGTGTTT 

35401 + + + + + + 35460 

CTGGCGACGTGCTGCCGCGCGGGGCTGGAAGGCCTGTAACAGCACGGGAGTCTACACAAA 

CGCATCTTCAGGAGTGCTCAGTGATCCGTGAGGTGAGAAAGGGACGGTGGTCCGGTCAGT 

35461 + + + + + + 35520 

GCGTAGAAGTCCTCACGAGTCACTAGGCACTCCACTCTTTCCCTGCCACCAGGCCAGTCA 
18-* * 

CGTTGCCGCGCGGGCTGTTCTGGTAAGCGGCCAGACGCCACTGCCCGTCCTGTTCGACGG 

35521 + + + + - + + 35580 

GCAACGGCGCGCCCGACAAGACCATTCGCCGGTCTGCGGTGACGGGCAGGACAAGCTGCC 
18 DNGRPSNQYAALRWQGDQEV 

CCAGCCAGGAGGCCCGGACGGCGCCGTCGCCGCTCGCCTCGGTCTCCCCCGGGGCGAGGA 

35581 + + + + + + 35640 

GGTCGGTCCTCCGGGCCTGCCGCGGCAGCGGCGAGCGGAGCCAGAGGGGGCCCCGCTCCT 
18 ALWSARVAGDGSAETEGPAL 

TGCCGCCCTCGGTGATGAGCAGGGCGATGCCGTCGCCGAGCAGGCGCGCGTCGATGGGGC 

35641 + + + + + + 35700 

ACGGCGGGAGCCACTACTCGTCCCGCTACGGCAGCGGCTCGTCCGCGCGCAGCTACCCCG 
18 IGGETI LLAIGDGLLRADI P 

TGCCGATGACACGGGTGCCCTTGTACGGGCCCGCGAAGGCGGCCGCCATGTGGGTGCGGA 

35701 + + + 4- + + 35760 

ACGGCTACTGTGCCCACGGGAACATGCCCGGGCGCTTCCGCCGGCGGTACACCCACGCCT 
18 SGIVRTGKYPGAFAAAMHTR 

TGTTCTCGCGGCCCTTGCGGAAGAGGCCGGGGAGGATCATCGTCCCGTCCTCGGCGAAGA 

35761 + + + + + + 35820 

ACAAGAGCGCCGGGAACGCCTTCTCCGGCCCCTCCTAGTAGCAGGGCAGGAGCCGCTTCT 
18 INERGKRFLGPLIMTGDEAF 

CGTCGGCGAACCGGTCGGCGTCGTGGTCGGCCCAGGCGGCCACGATGCGCGCCGGCAGAG 

35821 + + + + + + 35880 

GCAGCCGCTTGGCCAGCCGCAGCACCAGCCGGGTCCGCCGGTGCTACGCGCGGCCGTCTC 
18 VDAFRDADHDAWAAVI RAPL 

CGGCTACCGCTGCCAGGGCGGCGTCGGGAGCGGAGGTGGTCGAGTCGGTGCTGGTCATAT 
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35881 + + + + + + 35940 

GCCGATGGCGACGGTCCCGCCGCT^GCCCTCGCCTCCACCAGCTCAGCCACGACCAGTATA 
18-< AAVAALAADP ASTTSDTSTM 

CGCGGTTCCCGTCCGTTGGTTGGCGGTTTCGGCACGGCCCGCAGCCCTGCCCGAGCCCGA 

35941 + + + + + + 36000 

GCGCCAAGGGCAGGCAACCAACCGCCAAAGCCGTGCCGGGCGTCGGGACGGGCTCGGGCT 

CGCTGGCAGGCGGCCCCGTCATCAGGCATCTCCTGCGTTGCGCCCCACGCCAGTCACTTC 

36001 + + + + + + 36060 

GCGACCGTCCGCCGGGGCAGTAGTCCGTAGAGGACGCAACGCGGGGTGCGGTCAGTGAAG 

ACGGCCAGAACAAGTCGCGCATTCTGGAAGAAGCTGAGGCCCGCGACCCGGTGCGACGAT 

36061 + + + + + + 36120 

TGCCGGTCTTGTTCAGCGCGTAAGACCTTCTTCGACTCCGGGCGCTGGGCCACGCTGCTA 

CTGCGGTGTCACGGAGTTCGCACACGTTTACGCACGGAGGCTCGATGCCCGCTGTCAATG 

36121 + + + 4- + + 36180 

GACGCCACAGTGCCTCAAGCGTGTGCAAATGCGTGCCTCCGAGCTACGGGCGACAGTTAC 
5-> M P A V N G - 

GATCGGTGCAGTCAGGCCAGTCGCACCGACGCTCCGTCGTGGCGACGGTGGTGGGCAACT 

36181 + + + + + + 36240 

CTAGCCACGTCAGTCCGGTCAGCGTGGCTGCGAGGCAGCACCGCTGCCACCACCCGTTGA 
5 SVQSGQSHRRSVVATVVGNF- 

TCGTGGAGTCGTTCGACTGGCTCGCCTACGGGCTCTTCGCTCCTCTCTTCGCGGCTCAGT 

36241 + + + + + + 36300 

AGCACCTCAGCAAGCTGACCGAGCGGATGCCCGAGAAGCGAGGAGAGAAGCGCCGAGTCA 
5 VESFDWLAYGLFAPLFAAQF- 

TCTTCCCCTCGTCCAACCAGTTCACCTCCCTGCTCGGCGCGTTCGCGGTCTTCGGCACGG 

36301 + + + + + + 36360 

AGAAGGGGAGCAGGTTGGTCAAGTGGAGGGACGAGCCGCGCAAGCGCCAGAAGCCGTGCC 
5 FPSSNQFTSLLGAFAVFGTG- 

GCATGCTCTTCCGGCCGATCGGCGGGGTCCTGCTGGGCCGCCTCGCCGACCGGCGCGGCC 

36361 + + + + + + 36420 

CGTACGAGAAGGCCGGCTAGCCGCCCCAGGACGACCCGGCGGAGCGGCTGGCCGCGCCGG 
5 MLFRPIGGVLLGRLADRRGR- 

GGCGCCCCGCCCTGATGCTGGCGATCGGACTGATGACCGGCGGCTCGACCCTGATCGCCG 

36421 + + + + + + 36480 

CCGCGGGGCGGGACTACGACCGCTAGCCTGACTACTGGCCGCCGAGCTGGGACTAGCGGC 
5 RPALMLAI GLMTGGS TL I A V - 

TCGTCCCCACCTACGAGCACATCGGGATCCTCGCCCCGCTGCTTCTGCTGCTCGCCCGGC 

36481 + + + + + + 36540 

AGCAGGGGTGGATGCTCGTGTAGCCCTAGGAGCGGGGCGACGAAGACGACGAGCGGGCCG 
5 VPTYEHIGILAPLLLLLARL- 

TCGCCCAGGGAGTCTCCTCGGGCGGGGAATGGACAGCGGCGGCCACCTACCTGATGGAGA 

36541 + + + + + + 36600 

AGCGGGTCCCTCAGAGGAGCCCGCCCCTTACCTGTCGCCGCCGGTGGATGGACTACCTCT 
5 AQGVS SGGEWTAAATYLME I- 

TCGCGCCGAAGAACCGCCGGTGCCTCTACAGCAGCCTCTTCTCCGTGACGACCATGGCGG 

36601 + + + + + + 36660 

AGCGCGGCTTCTTGGCGGCCACGGAGATGTCGTCGGAGAAGAGGCACTGCTGGTACCGCC 
5 APKNRRCLYSSLFSVTTMAG- 

GCCCCTTCGTCGCATCGCTGCTGGGCGCGGGCCTCGGCGTGTGGCTGGGAACCGCGACGA 

36661 + + + + + + 36720 

CGGGGAAGCAGCGTAGCGACGACCCGCGCCCGGAGCCGCACACCGACCCTTGGCGCTGCT 
5 PFVAS LLGAGLGVWLGTATM- 
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TGGAGGCCTGGGGCTGGCGGGTGCCGTTCCTCCTCGGCGGCGTCTTCGGCGTGATCCTGC 

36721 + + + + + + 36780 

ACCTCCGGACCCCGACCGCCCACGGCAAGGAGGAGCCGCCGCAGAAGCCGCACTAGGACG 
5 EAWGWRVPFLLGGVFGVILL- 



TGTTCCTGCGCCGTCGGCTCACCGAGACCGAGGTCTTCCGCCGGGAGGTGCGGCCCCGGG 

36781 + + + + + + 36840 

ACAAGGACGCGGCAGCCGAGTGGCTCTGGCTCCAGAAGGCGGCCCTCCACGCCGGGGCCC 
5 FLRRRLTETEVFRREVR P R A - 

CCCGGCGCGGCTCACTGGGCCAGCTGATCGGAGCCCACCGCCCCCAGGTGCTGCTGGCCG 

36841 + + + + + + 36900 

GGGCCGCGCCGAGTGACCCGGTCGACTAGCCTCGGGTGGCGGGGGTCCACGACGACCGGC 
5 RRGSLGQL I GAHRPQVLLAV- 

TGATGCTGGTGGCCGGACTGGGCGTCATCGGCGGAACGTGGTCGACCGCGGTCCCGGCGA 

36901 + + + + + + 36960 

ACTACGACCACCGGCCTGACCCGCAGTAGCCGCCTTGCACCAGCTGGCGCCAGGGCCGCT 
5 MLVAGLGV I GGTWS TAV P A M - 

TGGGCCACCGTCTGATCGGCTCGCAGACGATGTTCTGGGTGGTGGTCTGTGTGACCGGCT 

36961 + + + + + + 37020 

ACCCGGTGGCAGACTAGCCGAGCGTCTGCTACAAGACCCACCACCAGACACACTGGCCGA 
5 GHRLI GS QTMFWVVVCVTGS- 

CGGTCATCCTGCTGCAGGTACCCATAGGGCTGCTCGCCGACCGGGTGGAACCGGGCAGGT 

37021 + + + + + + 37080 

GCCAGTAGGACGACGTCCATGGGTATCCCGACGAGCGGCTGGCCCACCTTGGCCCGTCCA 
5 VI LLQVP I GLLADRVE PGRF- 

TCCTGATCGTCTCCAGCGTCGTCTTCGCCGCTGTGGGCTCGTACGCCTACCTCACCGTCC 

37081 + + : + + + + 37140 

AGGACTAGCAGAGGTCGCAGCAGAAGCGGCGACACCCGAGCATGCGGATGGAGTGGCAGG 
5 LIVSSVVFAAVGSYAYLTVQ- 

AGGACTCCTTCGCGAGCCTGGCGTTCACGTACAGCACCGGAGTGATCTTCCTCGGCTGCG 

37141 + + + + + + 37200 

TCCTGAGGAAGCGCTCGGACCGCAAGTGCATGTCGTGGCCTCACTAGAAGGAGCCGACGC 
5 DS FASLAFTYSTGV I FLGCV- 

TCACCATGGTGCTGCCGAAGATGCTCTCCAGAATCTTCCCTCCGCAGATACGCGGCCTGG 

37201 + + + + + + 37260 

AGTGGTACCACGACGGCTTCTACGAGAGGTCTTAGAAGGGAGGCGTCTATGCGCCGGACC 
5 TMVLPKMLSRIFPPQIRGLG- 

GCATCGGGCTGCCGCACGCCTCGACCACCGCACTCCTCGGCGGGGCGGGGCCACTGCTGG 

37261 + + + + + + 37320 

CGTAGCCCGACGGCGTGCGGAGCTGGTGGCGTGAGGAGCCGCCCCGCCCCGGTGACGACC 
5 I GLPHASTTALLGGAGPLLA- 

CCGCCTACTCCGACGAGCGAGGCGCCTCGGGCTGGTTCATCGCCGCCGTGATGGCCGCGG 

37321 + + + + + + 37380 

GGCGGATGAGGCTGCTCGCTCCGCGGAGCCCGACCAAGTAGCGGCGGCACTACCGGCGCC 
5 AY SDE RGASGWF IAAVMAAV- 



TCCTGCTCGCCTGGCCGGCCACCCTGTGGGAGCGACGGCTGTTCCGCGCCCGGACGGCCC 

37381 + + + + + + 37440 

AGGACGAGCGGACCGGCCGGTGGGACACCCTCGCTGCCGACAAGGCGCGGGCCTGCCGGG 
5 LLAWPATLWERRLFRARTAP- 



CGGGAAGCGAGCCGGTTCCCGAATCCGCCGTCGCCCGCCCCGTCGGGTGACCGTCCGCAC 

37441 + + + + + + 37500 

GCCCTTCGCTCGGCCAAGGGCTTAGGCGGCAGCGGGCGGGGCAGCCCACTGGCAGGCGTG 
5-* GSEPVPESAVARPVG*- 
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TTCTGCATCCCGTCCGGCACCGAGCGCCGGCGACCTTCCCGACTGAGAGGTTGACATCAT 

37501 + + + + + + 37560 

AAGACGTAGGGCAGGCCGTGGCTCGCGGCCGCTGGAAGGGCTGACTCTCCAACTGTAGTA 
23-> M - 

GACGACGTCCGACACCACCGACCGGTCCCAGGACGGCGTGCCGCCGCTCTCCTTCCACCA 

37561 + + + - + + + 37620 

CTGCTGCAGGCTGTGGTGGCTGGCCAGGGTCCTGCCGCACGGCGGCGAGAGGAAGGTGGT 
23 TTSDTTDRSQDGVPPLSFHQ 

GGAGTTCCTGTGCATGTTCGACAGCGGGAACGACGGCGCCGACGTGGGGCCGTTCGGCCC 

37621 + + + + + + 37680 

CCTCAAGGACACGTACAAGCTGTCGCCCTTGCTGCCGCGGCTGCACCCCGGCAAGCCGGG 
23 EFLCMFDSGNDGADVGPFGP- 

CATGTACCACATCGTCGGAGCCTGGCGGCTGACCGGCGGGATCGACGAGGAGACCCTGCG 

37681 + + + + + + 37740 

GTACATGGTGTAGCAGCCTCGGACCGCCGACTGGCCGCCCTAGCTGCTCCTCTGGGACGC 
23 MYHIVGAWRLTGGIDEETLR- 

CGAGGCGCTGGGTGACGTCGTCGTGCGCCACGAGGCCCTGCGCACATCGCTGGTCCGCGA 

37741 + + + + + + 37800 

GCTCCGCGACCCACTGCAGCAGCACGCGGTGCTCCGGGACGCGTGTAGCGACCAGGCGCT 
23 EALGDVVVRHEALRTSLVRE 

% AGGTGGCACGCACCGGCCGGAGATCCTGCCTGCGGGGCCCGCCGCGCTGGAGGTCCGTGA 

^J: 37801 + + + + + + 37860 

4= TCCACCGTGCGTGGCCGGCCTCTAGGACGGACGCCCCGGGCGGCGCGACCTCCAGGCACT 
23 GGTHRPEI LPAGPAALEVRD 

LI TCTCGGCGACGTCGACGAGTCGGAGCGGGTGCGGCGCGGTGAGGAACTGCTCAACGAGGT 

k* s 37861 + + + + + + 37920 

J; AGAGCCGCTGCAGCTGCTCAGCCTCGCCCACGCCGCGCCACTCCTTGACGAGTTGCTCCA 

m 23 LGDVDESERVRRGEELLNEV- 

U GGAGTCGACCGGTCTGAGCGTGCGGGAGCTGCCCCTGCTGCGGGCCGTGCTCGGACGCTT 

L= 37921 + + + + + + 37980 

^ CCTCAGCTGGCCAGACTCGCACGCCCTCGACGGGGACGACGCCCGGCACGAGCCTGCGAA 

hJ 23 ESTGLSVRELPLLRAVLGRF 

G CGACCAGAAGGACGCGGTGCTGGTCCTCATCGCCCACCACACCGCCGCGGACGCCTGGGC 

p 37981 + + + + + + 38040 

GCTGGTCTTCCTGCGCCACGACCAGGAGTAGCGGGTGGTGTGGCGGCGCCTGCGGACCCG 
23 DQKDAVLVL IAHHTAADAWA 

CATGCACGTCATCGCCCGCGACCTGCTCAACCTGTACGCCGCCAGGCGCGGGAACCCGGT 

38041 + + + + + + 38100 

GTACGTGCAGTAGCGGGCGCTGGACGAGTTGGACATGCGGCGGTCCGCGCCCTTGGGCCA 
23 MHVIARDLLNLYAARRGNPV- 

TCCCCCGCTCCCCGAGCCGGCCCAGCATGCCGAGTTCGCCCGCTGGGAGCGCGAGGCGGC 

38101 + + + + + + 38160 

AGGGGGCGAGGGGCTCGGCCGGGTCGTACGGCTCAAGCGGGCGACCCTCGCGCTCCGCCG 
23 PPLPEPAQHAEFARWEREAA- 

CGAGGCACCGCGGGTCGCGGTCTCGAAGGAATTCTGGCGCAAGCGCCTCCAGGGCGCGCG 

38161 + + + + + + 38220 

GCTCCGTGGCGCCCAGCGCCAGAGCTTCCTTAAGACCGCGTTCGCGGAGGTCCCGCGCGC 
23 EAPRVAVSKEFWRKRLQGAR- 

GATCATCGGGCTGGAGACGGACATACCGCGCTCGGCGGGGCTGCCCAAGGGCACCGCGTG 

38221 + + + + + + 38280 

CTAGTAGCCCGACCTCTGCCTGTATGGCGCGAGCCGCCCCGACGGGTTCCCGTGGCGCAC 
23 I I GLETDI PRSAGLPKGTAW 

GCAGCGCTTCGCCGTACGCGGGGAACTGGCCGACGCCGTGGTGGAGTTCTCACGGGCCGC 
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38281 + + + + + + 38340 

CGTCGCGAAGCGGCATGCGCCCCTTGACCGGCTGCGGCACCACCTCAAGAGTGCCCGGCG 
23 QRFAVRGELADAVVE FSRAA 

CAAGTGCTCCCCGTTCATGACCATGTTCGCCGCCTACCAGGTGCTGCTGCACCGCAGGAC 

38341 + + + + + + 38400 

GTTCACGAGGGGCAAGTACTGGTACAAGCGGCGGATGGTCCACGACGACGTGGCGTCCTG 
23 KCS PFMTMFAAYQVLLHRRT 

GGGCGAGCTGGACATCACCGTGCCGACCTTCTCCGGGGGGCGCAACAACTCGCGGTTCGA 

38401 + + + + + + 38460 

CCCGCTCGACCTGTAGTGGCACGGCTGGAAGAGGCCCCCCGCGTTGTTGAGCGCCAAGCT 
23 GELDITVPTFSGGRNNSRFE 



GGACACCGTCGGTTCCTTCATCAACTTCCTGCCGCTGCGTACCGACCTCTCCGGATGCGC 

38461 -f + + + + + 38520 

CCTGTGGCAGCCAAGGAAGTAGTTGAAGGACGGCGACGCATGGCTGGAGAGGCCTACGCG 
23 DTVGSFINFLPLRTDLSGCA- 



ATCCTTCCGCGAGGTCGTGCTGCGCACCCGCACCACCTGCGGAGAGGCGTTCACCCACGA 

38521 + + + + + + 38580 

TAGGAAGGCGCTCCAGCACGACGCGTGGGCGTGGTGGACGCCTCTCCGCAAGTGGGTGCT 
23 SFREVVLRTRTTCGEAFTHE- 



GCTGCCCTTCTCCCGGCTGATCCCGGAGGTGCCGGAGCTGATGGCGTCGGCGGCCTCCGA 

38581 + + + + + + 38640 

CGACGGGAAGAGGGCCGACTAGGGCCTCCACGGCCTCGACTACCGCAGCCGCCGGAGGCT 
23 LPFSRLI PEVPELMASAASD- 



CAACCACCAGATCTCCGTCTTCCAGGCCGTGCACGCGCCCGCGTCCGAGGGGCCCGAGCA 

38641 + + + + + + 38700 

GTTGGTGGTCTAGAGGCAGAAGGTCCGGCACGTGCGCGGGCGCAGGCTCCCCGGGCTCGT 
23 NHQI SVFQAVHAPASEGPEQ 

GGCCGGGGACCTGACGTACTCGAAGATCTGGGAGCGGCAGCTGTCGCAGGCGGAGGGCTC 

38701 + + + + + + 38760 

CCGGCCCCTGGACTGCATGAGCTTCTAGACCCTCGCCGTCGACAGCGTCCGCCTCCCGAG 
23 AGDLTYS KIWERQLSQAEGS 

CGACATCCCCGACGGGGTGCTGTGGTCGATCCACATCGACCCCTCGGGCTCCATGGCCGG 

38761 + + + + + + 38820 

GCTGTAGGGGCTGCCCCACGACACCAGCTAGGTGTAGCTGGGGAGCCCGAGGTACCGGCC 
23 DIPDGVLWSIHIDPSGSMAG- 

CAGCCTCGGGTACAACACCAACCGCTTCAAGGACGAGACGATGGCGGCCTTCCTGGCCGA 

38821 + + + + + + 38880 

GTCGGAGCCCATGTTGTGGTTGGCGAAGTTCCTGCTCTGCTACCGCCGGAAGGACCGGCT 
23 SLGYNTNRFKDETMAAFLA D- 



CTACCTCGACGTGCTCGAGAACGCGGTGGCCCGGCCGGACGCCCCCTTCACCTCCTGAGA 

38881 + + + + + + 38940 

GATGGAGCTGCACGAGCTCTTGCGCCACCGGGCCGGCCTGCGGGGGAAGTGGAGGACTCT 
23-* YLDVLENAVARPDAPFTS * - 



CAGTTCCGGCGGCGGCGAACCCGCCCGAAGAAAGGAAAGCCAGTGTCCACCGTTTCCGAC 

38941 + + + -- + + + 39000 

GTCAAGGCCGCCGCCGCTTGGGCGGGCTTCTTTCCTTTCGGTCACAGGTGGCAAAGGCTG 
26-> M S T V S D 



ACAGCGGCCGGCTCCTCCCTGGAGGAGAAGGTCACCCGGATCTGGACGGGTGTTCTCGGC 

39001 + + + + + + 39060 

TGTCGCCGGCCGAGGAGGGACCTCCTCTTCCAGTGGGCCTAGACCTGCCCACAAGAGCCG 
26 TAAGSSLEEKVTRIWTGVLG 

ACGTCCGGTGAGGAAGGCGCGACGTTCATCGAGCTCGGAGGGCAGTCGGTCTCGGCCGTG 
39061 + + + + + + 39120 
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TGCAGGCCACTCCTTCCGCGCTGCAAGTAGCTCGAGCCTCCCGTCAGCCAGAGCCGGCAC 
26 TSGEEGATFI ELGGQSVSAV 

CGCATCGCCACGCGTATCCAGGAGGAGCTCGACATCTGGGTCGACATCGGCGTCCTCTTC 

39121 + + + + + + 39180 

GCGTAGCGGTGCGCATAGGTCCTCCTCGAGCTGTAGACCCAGCTGTAGCCGCAGGAGAAG 
26 RIATRIQEELDIWVDIGVLF 

GACGACCCGGATCTGCCTACCTTCATCGCGGCGGTCGTCCGGACGGCCGACGCCGCGGGC 

39181 + + + + + + 39240 

CTGCTGGGCCTAGACGGATGGAAGTAGCGCCGCCAGCAGGCCTGCCGGCTGCGGCGCCCG 
26 DDPDLPTFIAAVVRTADAAG 

GGCGAGGGCTCCGGAACGCAGTGAGACTCGCCGGGCGCCGTCTCCCCGCGGCGCCCGGTT 

39241 + + + + + + 39300 

CCGCTCCCGAGGCCTTGCGTCACTCTGAGCGGCCCGCGGCAGAGGGGCGCCGCGGGCCAA 
26-* GEGSGTQ*- 

TCACATGGCTGAGGCGGTTCACCCGGTACCGGGTGAACCGCCTCAGCCATGTGAAACCGG 

39301 + + + + + + 39360 

AGTGTACCGACTCCGCCAAGTGGGCCATGGCCCACTTGGCGGAGTCGGTACACTTTGGCC 

GCCTGGTCAGCGCAGCTGGATGTCCGTCTCCCGGGCGATCGCCCGGAGGAACTCGCCGCG 

39361 + + + + + + 39420 

CGGACCAGTCGCGTCGACCTACAGGCAGAGGGCCCGCTAGCGGGCCTCCTTGAGCGGCGC 
24-* *RLQIDTERAIARLFEGR- 

GGACAGCGCGTCGGCGACCAGCTCGATGTCGTCGGCCATGTACCGGTCGACGCCCAGCGT 

39421 + + + + + + 39480 

CCTGTCGCGCAGCCGCTGGTCGAGCTACAGCAGCCGGTACATGGCCAGCTGCGGGTCGCA 
24 SLADAVLE I DDAMYRDVGLT- 

CGGAACCAGCCGGCGCACCGCTTCGTACGTGGCCTTCGCCGCCGGGCTCAAGCCGTCGAA 

39481 + + + + + + 39540 

GCCTTGGTCGGCCGCGTGGCGAAGCATGCACCGGAAGCGGCGGCCCGAGTTCGGCAGCTT 
24 PVLRRVAEYTAKAAPSL GDF- 

CCGGCCGGAGATGTCGACCGCCTGGGCGGCGGCCAGGTACTCCACCGCGAGGATCTTGTT 

39541 + + + + + + 39600 

GGCCGGCCTCTACAGCTGGCGGACCCGCCGCCGGTCCATGAGGTGGCGCTCCTAGAACAA 
24 RGS I DVAQ AAALYEVAL I KN- 

GTTGTTCGACAGGACCCGGCGGGCGTTGCGGGCCGAGATCAGGCCCATGCTCACCACGTC 

39601 + + + + + + 39660 

CAACAAGCTGTCCTGGGCCGCCCGCAACGCCCGGCTCTAGTCCGGGTACGAGTGGTGCAG 
24 NNSLVRRANRASILGMSVVD- 

CTGGTTGTCGCCGTTGGACGGGACGCTCTGGGTGCTGGCCGGGCCGATCGTCCGGTTCTC 

39661 + + + + + + 39720 

GACCAACAGCGGCAACCTGCCCTGCGAGACCCACGACCGGCCCGGCTAGCAGGCCAAGAG 

24 Q NDGNS PVSQTSAPGI TRNE- 

GGCCACCAGTGCGGTGGCCGGGTACTGGGCGCCGGCGAATCCGCTGTGCAGCCCCGGGTC 

39721 + + + + + + 39780 

CCGGTGGTCACGCCACCGGCCCATGACCCGCGGCCGCTTAGGCGACACGTCGGGGCCCAG 
24 AVLATAPYQAGAFGSHLGPD- 

CCCGGAGACGAGGAACTCCGGGAGGCCGTAGCTGAGGTGCCGGTTCAGGACCCGGTTGAT 

39781 + + + + + + 39840 

GGGCCTCTGCTCCTTGAGGCCCTCCGGCATCGACTCCACGGCCAAGTCCTGGGCCAACTA 
24 GSVLFEPLGYSLHRNLVRNI- 

CTGCCGCTCGGCCAGGACGCCGAGCTGGGTGAGCGCGATGGTCACGAAGTCCATCGCGAA 

39841 + + + + + + 39900 

GACGGCGAGCCGGTCCTGCGGCTCGACCCACTCGCGCTACCAGTGCTTCAGGTAGCGCTT 
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24 QR EALVGLQTLAI TVFDMAF- 

CGCGATCGGCTGACCGTGGAAGTTCGCCCCGTGGAAGATCTCCTTGCCCTCGAAGAAGAG 

39901 + + + + + + 39960 

GCGCTAGCCGACTGGCACCTTCAAGCGGGGCACCTTCTAGAGGAACGGGAGCTTCTTCTC 
24 AI PQGHFNAGHF I EKGEFFL- 

CGGGTTGTCGTTGGCCGAGTTGAGCTCGATGCGCAGCTTGTGCCGCGCGTGGTACAAGGT 

39961 + + + + + + 40020 

GCCCAACAGCAACCGGCTCAACTCGAGCTACGCGTCGAACACGGCGCGCACCATGTTCCA 
24 PNDNASNLE IRLKHRAHYLT- 

GTCGCGCACCGCCCCGACGACCTGGGGGATGGCCCGCAGCGAGTAGGCCTTCTGCAGGTA 

40021 + + + + + + 40080 

CAGCGCGTGGCGGGGCTGCTGGACCCCCTACCGGGCGTCGCTCATCCGGAAGACGTCCAT 
24 DRVAGVVQ P IARLS YAKQLY- 

GATCTCCGAGCGCTGGACGTCCTTGCCGGCCTCCTTGTCCTTCTGGAGTTCTCGGCGCAG 

40081 + + - + + + + 40140 

CTAGAGGCTCGCGACCTGCAGGAACGGCCGGAGGAACAGGAAGACCTCAAGAGCCGCGTC 
24 I ESRQVDKGAEKDKQLERRL- 

GTCGGCGTGCTCGACCGTCAGTCCGCTGCCCCGCATCAGGGCCCGCATGTTGGCGGCGGT 

40141 + + + + + + 40200 

CAGCCGCACGAGCTGGCAGTCAGGCGACGGGGCGTAGTCCCGGGCGTACAACCGCCGCCA 
24 DAHEVTLGSGRMLARMNAAT- 

GTCGATCTGGCCCTCGTGCGGGCGGGCTATGTCGTGCCCCTCCGCGAGGAAGGGGCTGGT 

40201 + + + + + + 40260 

CAGCTAGACCGGGAGCACGCCCGCCCGATACAGCACGGGGAGGCGCTCCTTCCCCGACCA 
24 DIQGEHPRAIDHGEALFPST- 

CGATCCGCGTACCGCCTCGATGAGCAGAGCCGTCACGATCTCGGCCTGCTGGGCCTGCTC 

40261 + + + + + + 40320 

GCTAGGCGCATGGCGGAGCTACTCGTCTCGGCAGTGCTAGAGCCGGACGACCCGGACGAG 
24 SGRVAE I LLATVI EAQQAQE- 

CAGGGCCCGTCCGACGACCAGGGAGCCCAGACCGGTCATCCCGGACGTGCCGTTGATCAG 

40321 + + -- + + + + 40380 

GTCCCGGGCAGGCTGCTGGTCCCTCGGGTCTGGCCAGTAGGGCCTGCACGGCT^ACTAGTC 
24 LARGVVLSGLGTMGSTGN I L- 

TGCGAGGCCCTCCTTGAAGCGCAGTTCGAGCGGCTCGATGCCCCGCTCGGCCAGCACCTG 

40381 + -f + + + + 40440 

ACGCTCCGGGAGGAACTTCGCGTCAAGCTCGCCGAGCTACGGGGCGAGCCGGTCGTGGAC 
24 ALGEKFRLELPE I GREALVQ- 

GGCGGTCTCCACCGGCCGTCCGTCGCGCAGGACGTAGCCCTCTCCGATGAGGGTGCTCGC 

40441 + + + + + + 40500 

CCGCCAGAGGTGGCCGGCAGGCAGCGCGTCCTGCATCGGGAGAGGCTACTCCCACGAGCG 
24 ATEVPRGDRLVYGEGILTSA- 

GACGTGGGAGAGGGGAGCCAGGTCGCCGCTCGCCCCGAGTGACCCGATCTCGGGTATGGC 

40501 + + + + + + 40560 

CTGCACCCTCTCCCCTCGGTCCAGCGGCGAGCGGGGCTCACTGGGCTAGAGCCCATACCG 
24 VHSLPALDGSAGLSGIEPIA- 

CGGGGTGATGCCCTCGTTCAGGTACTGCGCGAGGCGTTCGAGGATGATGGGGCGCACCGC 

40561 + + + + + + 40620 

GCCCCACTACGGGAGCAAGTCCATGACGCGCTCCGCAAGCTCCTACTACCCCGCGTGGCG 

24 PT IGENLYQALREL I I PRVA- 

GGAGTGGCCCTTGGCGAGGGTGTTCAGCCGGGCGGCGACGATCGCCCGCGCCTCGTCCTC 

40621 + + + + + + 40680 

CCTCACCGGGAACCGCTCCCACAAGTCGGCCCGCCGCTGCTAGCGGGCGCGGAGCAGGAG 
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24 S HGKALTNL RAAV I A R A E D E - 

GGCGAACAGCGGACCGACTCCCGCGCTGTGGCTACGGACGAGATTGGTCTGCAGTTCGAC 

40681 + + + + + + 40740 

CCGCTTGTCGCCTGGCTGAGGGCGCGACACCGATGCCTGCTCTAACCAGACGTCAAGCTG 
24 AFLPGVGASHSRVLNTQLEV- 

TTCCTTCGACTTGTCGACCTGCATGTAGATCATCTCGCCGTACCCGGTGGTCACCCCGTA 

40741 + + + + + + 40800 

AAGGAAGCTGAACAGCTGGACGTACATCTAGTAGAGCGGCATGGGCCACCAGTGGGGCAT 
24 E KS KDVQMY IMEGYGTTVGY- 

GATGGGGATGTTCTGTTCGGCGATCCCTTCGAAGATCTCCCGGCTCTTCTGGGCCTTCGC 

40801 + + + + -f + 40860 

CTACCCCTACAAGACAAGCCGCTAGGGAAGCTTCTAGAGGGCCGAGAAGACCCGGAAGCG 
24 I PINQEAIGEFI ERSKQAKA- 

GATGGATTCGGCCGGTACGTCGACCGTCGCGCGTTCCTCCGCGACGCGGCGTACGGCTTC 

40861 + + + + + + 40920 

CTACCTAAGCCGGCCATGCAGCTGGCAGCGCGCAAGGAGGCGCTGCGCCGCATGCCGAAG 
24 ISEAPVDVTAREEAVRRVAE- 

GACGGTCAGGGTCTCGCCGTCGACGGAAACCGGGACGATCTCGGTCTCGACTTGAGTCAA 

40921 + + + + + + 40980 

CTGCCAGTCCCAGAGCGGCAGCTGCCTTTGGCCCTGCTAGAGCCAGAGCTGAACTCAGTT 

24 VTLTEGDVSVPVI ETEVQTL- 

TGCCATCACTCCATGGGTAGCGGCCGAGGCCGGTGTACGACAGGTCAGGGGGTGGGTTCG 

40981 + + + + + + 41040 

ACGGTAGTGAGGTACCCATCGCCGGCTCCGGCCACATGCTGTCCAGTCCCCCACCCAAGC 

24- < A M - 

TGAGGCGCGGCTCAGCGGGTGAGCCGGGAGCGGTCCACCTTCCCCGCGGCGTTGCGCGGC 

41041 + + + + + + 41100 

ACTCCGCGCCGAGTCGCCCACTCGGCCCTCGCCAGGTGGAAGGGGCGCCGCAACGCGCCG 

25- * *RTLRSRDVKGAANRP 

AGGCGTGAAGTCAGGCGGGTGAAGACGGCGGGCAGTGCGAGGGGGCCGAACTGGCCGCGC 

41101 + + + + + + 41160 

TCCGCACTTCAGTCCGCCCACTTCTGCCGCCCGTCACGCTCCCCCGGCTTGACCGGCGCG 

25 LRSTLRTFVAPLALPGFQGR- 

AGATGGGAACGCCAGGCCCGGATGTCCGCGCGCACGTCCTCCCGGCCCTCTCCTTGTGGC 

41161 + + + + + + 41220 

TCTACCCTTGCGGTCCGGGCCTACAGGCGCGCGTGCAGGAGGGCCGGGAGAGGAACACCG 
25 L HSRWARIDARVDERGEGQP 

ACCACGTACACGGCGAGGCGGGTCACCAGGCCCTGGCCGTTGACGTGGGGGAGGACCGCG 

41221 + + + + + + 41280 

TGGTGCATGTGCCGCTCCGCCCAGTGGTCCGGGACCGGCAACTGCACCCCCTCCTGGCGC 
25 VVYVALRTVLGQGNVHPLVA- 

CACTCCAGGACCGAGGGGTCACGGTTCAGCGCGGCCTCGATCTCGGTGAGTTCCAAGCGG 

41281 + + + + + + 41340 

GTGAGGTCCTGGCTCCCCAGTGCCAAGTCGCGCCGGAGCTAGAGCCACTCAAGGTTCGCC 
25 CELVSPDRNLAAEIETLELR- 

TTCCCGAACAGCTTGACCTGGAAGTCCTTGCGGCCCCGGAATTCCAGGGCTCCGTCGAAC 

41341 + + + + + + 41400 

AAGGGCTTGTCGAACTGGACCTTCAGGAACGCCGGGGCCTTAAGGTCCCGAGGCAGCTTG 
25 NGFLKVQFDKRGRFEL AGDF 

CGTACCCGCGCCAGATCCCCGGTCCGGTACCACCGGTCACCGTCCGGGGCGAGGCCGGCG 

41401 + + + + + + 41460 

GCATGGGCGCGGTCTAGGGGCCAGGCCATGGTGGCCAGTGGCAGGCCCCGCTCCGGCCGC 
25 RVRALDGTRYWRDGDPALGA- 
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AGGGGCGCGAACAGCGCGCTGTGGTCCGGGCCGCCCTCGACGGCGAGATAACCCGGCGTC 

41461 + + + + + + 41520 

TCCCCGCGCTTGTCGCGCGACACCAGGCCCGGCGGGAGCTGCCGCTCTATTGGGCCGCAG 
25 LPAFLASHDPGGEVALYGPT 

ACGTACGGGGAGCGGATCACCAGTTCGCCGGTGACGCCGGCGGGGCTCGGCCGGTCGTCC 

41521 + + + + + + 41580 

TGCATGCCCCTCGCCTAGTGGTCAAGCGGCCACTGCGGCCGCCCCGAGCCGGCCAGCAGG 
25 VYPSRIVLEGTVGAPS PRDD 

GCGTCCACGACGAGTACCTGGCGGCCGGGGAGCGGGTACCCGATCGGGGCCGGGCCCGTG 

41581 + + -- + + + + 41640 

CGCAGGTGCTGCTCATGGACCGCCGGCCCCTCGCCCATGGGCTAGCCCCGGCCCGGGCAC 
25 ADVVLVQRGPLPYGI PAPGT 

ACCGGCCCGGTGATCTCGTGCCAGGTCGCGGCGATCGTCTCGGTGGGCCCGTAGAGGTTG 

41641 + + -- + + + + 41700 

TGGCCGGGCCACTAGAGCACGGTCCAGCGCCGCTAGCAGAGCCACCCGGGCATCTCCAAC 
25 VPGTIEHWTAAITETPGYLN- 

ATCAGGCGGGTCCGGGGCAGGGCCGCGCGCAGTCCGTCCACGAGTTCGCCGGGCAGCGCC 

41701 + ■ + + + + + 41760 

TAGTCCGCCCAGGCCCCGTCCCGGCGCGCGTCAGGCAGGTGCTCAAGCGGCCCGTCGCGG 
25 ILRTRPLAARLGDVLEGPLA- 

TCGCCCATCAGGAGCAGGTGGCCCAGGGTGCCGGGCCGATCGCCCGGGTCGGAGGCGGTG 

41761 + + + + + + 41820 

AGCGGGTAGTCCTCGTCCACCGGGTCCCACGGCCCGGCTAGCGGGCCCAGCCTCCGCCAC 
25 EGMLLLHGLTGPRDGPDSAT- 

ATCACTCCCAGGAGGTCCCGGGCGAAGCTGGGCACGGTCTGGAGATGAGTGATCCGCTCC 

41821 + + + + + + 41880 

TAGTGAGGGTCCTCCAGGGCCCGCTTCGACCCGTGCCAGACCTCTACTCACTAGGCGAGG 
25 IVGLLDRAFSPVTQLHTIRE 

TGGACGAGCCACGGCACCAGCTTGTCGGGGTTCACCCTGACGCGCTCCGGCACCGGACAC 

41881 + + + + + + 41940 

ACCTGCTCGGTGCCGTGGTCGAACAGCCCCT^AGTGGGACTGCGCGAGGCCGTGGCCTGTG 

25 QVLWPVLKDPNVRVREPVPC- 

AGCGTCCCGCCGGCCACGAGCGTCGCGAAGACCTCGGCCAGCGCCGGGTCGTGCTCCGGG 

41941 + + + + + + 42000 

TCGCAGGGCGGCCGGTGCTCGCAGCGCTTCTGGAGCCGGTCGCGGCCCAGCACGAGGCCC 
25 LTGGAVLTAF VEALAPDHEP- 



SEQ ID No. 2. C-1027 gene cluster DNA sequence from 41,980 to 63,164 

AGCGCCGGGTCGTGCTCCGGGGAGACCCACTGCGCCACCCGCGCGCCCGGCCCCATCGCG 

41980 + + + + - + + 42039 

TCGCGGCCCAGCACGAGGCCCCTCTGGGTGACGCGGTGGGCGCGCGGGCCGGGGTAGCGC 

25 LAPDHEPSVWQAVRAGPGMA- 

AACCGTTCGCCCATCCAGCCCGCGAACTGGCCCAGCGCGGCATGCGACTGGGCGATCCCC 

42040 + + + + + + 42099 

TTGGCAAGCGGGTAGGTCGGGCGCTTGACCGGGTCGCGCCGTACGCTGACCCGCTAGGGG 
25 FREGMWGAFQGLAAHSQAIG- 

TTGGGCCGCCCGGTCGAACCCGAGGTGAACGCCACGTAGGCCAGGTCTGCCAGGCCCGGC 

42100 + + + + + + 42159 

AACCCGGCGGGCCAGCTTGGGCTCCACTTGCGGTGCATCCGGTCCAGACGGTCCGGGCCG 
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25 KPRGTSGSTFAVYALDALGP- 

CCCGCCGCGGTCGTCGCGTCCGGGCCGGCGGCGGGTCGAGGGCCGAGCACAGAGGAGGCG 

42160 + + + + + + 42219 

GGGCGGCGCCAGCAGCGCAGGCCCGGCCGCCGCCCAGCTCCCGGCTCGTGTCTCCTCCGC 
25 GAATTADPGAAPRPGLVSSA- 

TCCAGCAGGGTGGCGCCCGGTTCACCGGCGTACCAGAGCGCCAGCGGATCCTCCTGCGGA 

42220 + + + + + + 42279 

AGGTCGTCCCACCGCGGGCCAAGTGGCCGCATGGTCTCGCGGTCGCCTAGGAGGACGCCT 
25 DLLTAGPEGAYWLALPDEQP- 

TCGCCGTCGAGGACCAGGCACGCCGGGCGCAGATCGCTGAGCATCGACCGGTGTCGTTCG 

42280 + + + + + + 42339 

AGCGGCAGCTCCTGGTCCGTGCGGCCCGCGTCTAGCGACTCGTAGCTGGCCACAGCAAGC 
25 DGDLVLCAPRLDSLMSRHRE 

CCCGCGCCGTCCGGAGCGAACCACGCCAGGTGGGCGCCCGCCTCCAGGACTCCCAGCAGC 

42340 + + + + + + 42399 

GGGCGCGGCAGGCCTCGCTTGGTGCGGTCCACCCGCGGGCGGAGGTCCTGAGGGTCGTCG 
25 GAGDPAFWALHAGAELVGLL- 

ACCGCGATCCGGCGGGCGCCCGGCTGCATCCGCACCGCCACCGGCGAGCCGTGCCCCGCG 

42400 + + + + + + 42459 

TGGCGCTAGGCCGCCCGCGGGCCGACGTAGGCGTGGCGGTGGCCGCTCGGCACGGGGCGC 
25 VAI RRAGPQMRVAVPSGHGA- 

CCGGCCGCGGTGAGGGCCGAGGCGACGCGGGCCGCGTCCGCGGTCAGTTCGGCGGTCAGT 

42460 + + - + + + + 42519 

GGCCGGCGCCACTCCCGGCTCCGCTGCGCCCGGCGCAGGCGCCAGTCAAGGCGCCAGTCA 
25 GAATLASAVRAADATLEATL 

TCGGCGTAGCTTGTGCGCGTGCCGCCGAACGAGACGGCGACACCGTCGTGTTCCGCGTGG 

42520 + + + + + + 42579 

AGCCGCATCGAACACGCGCACGGCGGCTTGCTCTGCCGCTGTGGCAGCACAAGGCGCACC 
25 EAYSTRTGGFSVAVGDHEAH 

CGGCGGACCGAGGCGTGCACCGGCCGCGTCATGTCCCCGCCGGACGCCCGGCGGTCCGAA 

42580 + + + + + + 42639 

GCCGCCTGGCTCCGCACGTGGCCGGCGCAGTACAGGGGCGGCCTGCGGGCCGCCAGGCTT 
25 RRVSAHVPRTM (ORF25) 

GCGCGCAGGGCGTGGTCCCGGTGGCGGTCGTCGTCCAGCGGCAGAGCGCCCACGGGTGTG 

42640 + + + + + + 42699 

CGCGCGTCCCGCACCAGGGCCACCGCCAGCAGCAGGTCGCCGTCTCGCGGGTGCCCACAC 

TCCGGATCCGTGGTCGCGGCGGTCAGGAGGACGGCCAGCTGATCCAGCATCCGCCGGGCC 

42700 + + + + + + 42759 

AGGCCTAGGCACCAGCGCCGCCAGTCCTCCTGCCGGTCGACTAGGTCGTAGGCGGCCCGG 

GAAGCGGGCTCGAACAGAGCTTCGCGGTACTCCAGGTAGCCGGTGACCGAGGGCGCGGTG 

42760 + + + + + + 42819 

CTTCGCCCGAGCTTGTCTCGAAGCGCCATGAGGTCCATCGGCCACTGGCTCCCGCGCCAC 

TCCTGCAGCACCAGGGTCAGGTCGGCGGCGGCAGTGCCGTTGTGCACGGACAGCCGCCTC 

42820 + + + + + + 42879 

AGGACGTCGTGGTCCCAGTCCAGCCGCCGCCGTCACGGCAACACGTGCCTGTCGGCGGAG 

ACCTCGGCGCCTGGTATCCGCAGGCCCGGCCGCTCCTCGTGGACGAACACGGCGTCGGCC 

42880 + + + + + + 42939 

TGGAGCCGCGGACCATAGGCGTCCGGGCCGGCGAGGAGCACCTGCTTGTGCCGCAGCCGG 

CCCTCGATCCGGCACGGCCCGGGGGCCGGGGCCGGCGTCGTGTGCAGCAGCTCCCGGAAG 

42940 + + + + + + 42999 

GGGAGCTAGGCCGTGCCGGGCCCCCGGCCCCGGCCGCAGCACACGTCGTCGAGGGCCTTC 
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GCGGTGGCCGGCGTGCCGTCGTCCTGTCCGGCGTAGCGCTGGACCAGGGCTCGGAATCCG 



43000 



+ 



43059 



CGCCACCGGCCGCACGGCAGCAGGACAGGCCGCATCGCGACCTGGTCCCGAGCCTTAGGC 

GCCAGCACCACGGCCGCGGCGGTGACCCCTTCCGCTTCGGCGAGCCGGGCCGTACGGAAG 

43060 + + + + + + 43119 

CGGTCGTGGTGCCGGCGCCGCCACTGGGGAAGGCGAAGCCGCTCGGCCCGGCATGCCTTC 

CCGAGGTCCGGACTCCAGCCGAAGGCGACGGTGCTCCCCGCGTGCGAGGGCAGGTGCGGG 

43120 + + + + + + 43179 

GGCTCCAGGCCTGAGGTCGGCTTCCGCTGCCACGAGGGGCGCACGCTCCCGTCCACGCCC 

CGGTTCCGGTCGGCGGGCAGGACCTGTCCGGAGGCGGTCGCCGAAGACTCCTCGCTCCCG 

43180 + + + + + + 43239 

GCCAAGGCCAGCCGCCCGTCCTGGACAGGCCTCCGCCAGCGGCTTCTGAGGAGCGAGGGC 

GGCGCCCGGGGCGTTTGCGGCGCGGGCGCAGTGGGAGGCCGGCCGCCGGTGGTGACGGCG 

43240 + + + + + + 43299 

CCGCGGGCCCCGCAAACGCCGCGCCCGCGTCACCCTCCGGCCGGCGGCCACCACTGCCGC 

AGGTACGCGTTCGACAACGCGGCCGGCAGGGGCCCGGACGGCCCGTCCCAGGCTCCGGAG 

43300 + + + + + + 43359 

TCCATGCGCAAGCTGTTGCGCCGGCCGTCCCCGGGCCTGCCGGGCAGGGTCCGAGGCCTC 

TGCGAGGCCACCAGGAGAAGCAGGTGCGCGCGTGGGCCTCTGCGGGCGATGTGGAGCCGT 

43360 + + + + + + 43419 

ACGCTCCGGTGGTCCTCTTCGTCCACGCGCGCACCCGGAGACGCCCGCTACACCTCGGCA 

GCGGGCGCGTCACCCTCGGCGAAGGGACGGGCCGCCCAGCGAGCGCAGAGTTCCTCCTCC 

43420 + + + + + + 43479 

CGCCCGCGCAGTGGGAGCCGCTTCCCTGCCCGGCGGGTCGCTCGCGTCTCAAGGAGGAGG 

CCGCACTCCTCGTCGGCACTCGGCCCGTCCACGGCGGCCCCGTCTCCGGCGGCGGCCCGC 

43480 + + + + + + 43539 

GGCGTGAGGAGCAGCCGTGAGCCGGGCAGGTGCCGCCGGGGCAGAGGCCGCCGCCGGGCG 

CAGGCCGTCCGCAGGGCCTCCAGGTCGAGTCCGCCGCTCACGTGGTAGGCCGCGTACGGG 

43540 + + + + + + 43599 

GTCCGGCAGGCGTCCCGGAGGTCCAGCTCAGGCGGCGAGTGCACCATCCGGCGCATGCCC 

TGCAACACCGCAGATCCGGAGGCCGGCGAAGGCCCCCGGTCCGGCTCGGTCACAGTCACG 

43600 + + + + + + 43659 

ACGTTGTGGCGTCTAGGCCTCCGGCCGCTTCCGGGGGCCAGGCCGAGCCAGTGTCAGTGC 

TCATTCGCCACGACGCCCATCTTGGGGCGGCGGCGCACAGGACGCTTCTCCTTGAGTGCG 

43660 + + + + + + 43719 

AGTAAGCGGTGCTGCGGGTAGAACCCCGCCGCCGCGTGTCCTGCGAAGAGGAACTCACGC 

GAGCTCCGCGTACGGCGCCGAAGCGTTCGGTCAAACCTTGTTCGACCAACTGCGCAATCT 

43720 + + + + + + 43779 

CTCGAGGCGCATGCCGCGGCTTCGCAAGCCAGTTTGGAACAAGCTGGTTGACGCGTTAGA 

GGAAGTTGACGTCTTCCAGGTGGAGTTGGGAACGATGGAGGCCCCCGCCGGCCGCGTCGG 

43780 + + + + + + 43839 

CCTTCAACTGCAGAAGGTCCACCTCAACCCTTGCTACCTCCGGGGGCGGCCGGCGCAGCC 

AACGGCCGTGCAGTGCGGCCCTCTCCAACACTCCCGGCCATCGCGGAATCCGAGACGTGC 

43840 + + + + + + 43899 

TTGCCGGCACGTCACGCCGGGAGAGGTTGTGAGGGCCGGTAGCGCCTTAGGCTCTGCACG 

CCGAAGGAGCCCCCCTTGCAAGCCTGGTTCAAGCGCACCAGTGGTGTGCCCGGTGACAGA 

43900 + + + + + + 43959 

GGCTTCCTCGGGGGGAACGTTCGGACCAAGTTCGCGTGGTCACCACACGGGCCACTGTCT 
27 (ORF27) V P G D R 
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CGTGGAAAGTGGCTGGTCCTGGCCGCCTGGCTCATCATCGCGATGGCGCTGGGCCCGCTG 

43960. + + + + + + 44019 

GCACCTTTCACCGACCAGGACCGGCGGACCGAGTAGTAGCGCTACCGCGACCCGGGCGAC 
27 RGKWLVLAAWLI IAMALGPL 

GCGGGGAAGCTCGCCGACGTCCAGGACTCCAGCGCCAACGCCTTCCTTCCGCGCAGCTCG 

44020 + + + + + + 44079 

CGCCCCTTCGAGCGGCTGCAGGTCCTGAGGTCGCGGTTGCGGAAGGAAGGCGCGTCGAGC 
27 AGKLADVQDSSANAFL PRSS 

GAGTCCGCGAAGCTGAACAAGGAACTGGAGAAGTTCCGCGCCGACGAGCTGATGCCGGCC 

44080 + + + + + + 44139 

CTCAGGCGCTTCGACTTGTTCCTTGACCTCTTCAAGGCGCGGCTGCTCGACTACGGCCGG 
27 ESAKLNKELEKFRADELMPA 

GTGGTGGTCTACAGCGCCGACGGCTCGCTGCCCGCCGAGGGGCGGGCCAAGGCCGAGAAG 

44140 + + + + + + 44199 

CACCACCAGATGTCGCGGCTGCCGAGCGACGGGCGGCTCCCCGCCCGGTTCCGGCTCTTC 
27 VVVYSADGSLPAEGRAKAEK 

GACATAGCCGCCTTCCAGGAGCTGGCCGCCGAGGGCGAGAAGGTCGAAGCGCCCCTGGAG 

44200 + + + + + + 44259 

CTGTATCGGCGGAAGGTCCTCGACCGGCGGCTCCCGCTCTTCCAGCTTCGCGGGGACCTC 
27 DIAAFQELAAEGEKVEAPLE 

TCGGAGGACGGCCAGGCGCTCATGGTCGTCGTTCCGCTGATCAGCGACGCCGACATCGTC 

44260 + + + + + + 44319 

AGCCTCCTGCCGGTCCGCGAGTACCAGCAGCAAGGCGACTAGTCGCTGCGGCTGTAGCAG 
27 SEDGQALMVVVPLI SDADIV 

GCCACGACGAAGAAGGTCCGCGATGTCGCGGACGCCAACGCCCCCCCGGGCGTCGCCATC 

44320 + + + + + + 44379 

CGGTGCTGCTTCTTCCAGGCGCTACAGCGCCTGCGGTTGCGGGGGGGCCCGCAGCGGTAG 
27 ATTKKVRDVADANAP PGVAI 

GAGGTGGGCGGGCCCGCCGGGTCGACGACCGACGCCGCCGGCGCTTTCGAGTCCCTCGAC 

44380 + + + + + + 44439 

CTCCACCCGCCCGGGCGGCCCAGCTGCTGGCTGCGGCGGCCGCGAAAGCTCAGGGAGCTG 
27 EVGGPAGSTTDAAGAFESLD 

TCCATGCTGATGATGGTCACCGGCCTTGTGGTCGCCATCCTGCTGCTGATCACCTACCGC 

44440 + + - + + + + 44499 

AGGTACGACTACTACCAGTGGCCGGAACACCAGCGGTAGGACGACGACTAGTGGATGGCG 
27 SMLMMV TGLVVAILLLITYR 

TCCCCCATCCTGTGGCTGCTGCCCCTGCTCTCCGTCGGCTTCGCCTCCGTGCTGACCCAG 

44500 + + + + + + 44559 

AGGGGGTAGGACACCGACGACGGGGACGAGAGGCAGCCGAAGCGGAGGCACGACTGGGTC 
27 S P I LWL L PLLS VG FAS VLTQ 

GTCGGCACCTACATGCTCGCCAAGTACGCCGGGCTGCCGGTCGACCCGCAGAGCTCCGGC 

44560 + + + + + + 44619 

CAGCCGTGGATGTACGAGCGGTTCATGCGGCCCGACGGCCAGCTGGGCGTCTCGAGGCCG 
27 VGTYMLAKYAGLPVDPQSSG 

GTCCTGATGGTCCTCGTGTTCGGTGTCGGCACCGACTACGCCCTGCTGCTCATCGCCCGC 

44620 + + + + + + 44679 

CAGGACTACCAGGAGCACAAGCCACAGCCGTGGCTGATGCGGGACGACGAGTAGCGGGCG 
27 VLMVLVFGVGTDYALLLIAR 

TACCGTGAGGAACTGCGCCGCGAGCAGGACCGGCACGTGGCCATGAAGACCGCGTTGCGA 

44680 + + + + + + 44739 

ATGGCACTCCTTGACGCGGCGCTCGTCCTGGCCGTGCACCGGTACTTCTGGCGCAACGCT 
27 YREELRREQDRHVAMKTALR 

CGGTCGGGCCCGGCCATCCTGGCCTCGGCCGGCACCATCGCCATCGGCCTCGTCTGCCTG 
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44740 + + + - + + + 44799 

GCCAGCCCGGGCCGGTAGGACCGGAGCCGGCCGTGGTAGCGGTAGCCGGAGCAGACGGAC 
27 RSGPAILASAGTIAIGLVCL 

GTCCTCGCGGACGTCAACTCCTCCCGCTCCATGGGCCTGGTCGGCGCGATCGGCGTGGTC 

44800 + + + + + + 44859 

CAGGAGCGCCTGCAGTTGAGGAGGGCGAGGTACCCGGACCAGCCGCGCTAGCCGCACCAG 
27 VLADVNSSRSMGLVGAI GVV 

TGCGCCCTCCTCGCCATGGTCACGATCCTGCCCGCGCTGCTGGTCATCCTGGGCCGCTGG 

44860 + + + + + + 44919 

ACGCGGGAGGAGCGGTACCAGTGCTAGGACGGGCGCGACGAC CAGTAGGACCCGGCGAC C 
27 CALLAMVTILPALLVILGRW 

GTGTTCTGGCCCTTCGTTCCCCGCTGGACGCCGGAGTCGGCCGCGGCCCCCGAGGCACCG 

44920 + + + + + + 44979 

CACAAGACCGGGAAGCAAGGGGCGACCTGCGGCCTCAGCCGGCGCCGGGGGCTCCGTGGC 
27 VPWPFVPRWTPESAAAPEAP 

GCGTCCCACAGCCGCTGGGAGCGCATCGGCTCCGTCACGGCCGCCCGGCCGCGCCGCGCC 

44980 + + + + + + 45039 

CGCAGGGTGTCGGCGACCCTCGCGTAGCCGAGGCAGTGCCGGCGGGCCGGCGCGGCGCGG 
27 ASHSRWERIGSVTAARPRRA 

TGGGTGCTGTCCTTGGCCGCGACGGGGCTTCTCGCCCTCAGTTCCCTCGGCCTCGACATG 

45040 + + + + + + 45099 

ACCCACGACAGGAACCGGCGCTGCCCCGAAGAGCGGGAGTCAAGGGAGCCGGAGCTGTAC 
27 WVLSLAATGLLALS S LGLDM 

GGACTCACCCAGAGCGAACTGCTCCAGACGAAGCCCGAGTCCGTCGTCGCCCAGGAGCGG 

45100 + + + + + + - 45159 

CCTGAGTGGGTCTCGCTTGACGAGGTCTGCTTCGGGCTCAGGCAGCAGCGGGTCCTCGCC 
27 GLTQSELLQTKPESVVAQER 

ATCTCCGCCCACTACCCGTCCGGCTCCTCCGACCCCGCCACCGTCGTCGCACCCAGCGCG 

45160 + + + + + + 45219 

TAGAGGCGGGTGATGGGCAGGCCGAGGAGGCTGGGGCGGTGGCAGCAGCGTGGGTCGCGC 
27 ISAHYPSGSSDPATVVAPSA 

GACGTGGCCGAGGTCCGCCGGGCCGCCGAGGGGACCGACGGAGTGGTCTCCGTCCAGGAC 

45220 + + + + + + 45279 

CTGCACCGGCTCCAGGCGGCCCGGCGGCTCCCCTGGCTGCCTCACCAGAGGCAGGTCCTG 
27 DVAEVRRAAEGTDGVVSVQD 

GGCCCCACCACTCCCGACGGAGAGCTGACCATGCTGTCCGTGGTGCTGAAGGACGTTCCC 

45280 + + + + + + 45339 

CCGGGGTGGTGAGGGCTGCCTCTCGACTGGTACGACAGGCACCACGACTTCCTGCAAGGG 
27 GPTTPDGELTMLSVVLKDVP 

GACAGCAGCGGGGCCAAGGACACCATCGATGCACTGCGGGACAACACGGATGCTCTCGTG 

45340 + + + + + + . 45399 

CTGTCGTCGCCCCGGTTCCTGTGGTAGCTACGTGACGCCCTGTTGTGCCTACGAGAGCAC 
27 DSSGAKDTIDALRDNTDALV 

GGGGGTACGACGGCCCAGAGCCTGGACACCCAGCGCGCCTCGGTCCGTGACCTCTGGGTC 

45400 + + + + + + 45459 

CCCCCATGCTGCCGGGTCTCGGACCTGTGGGTCGCGCGGAGCCAGGCACTGGAGACCCAG 
27 GGTTAQSLDTQRASVRDLWV 

ACCGTCCCCGCGGTCCTGCTGGTGGTCCTGCTCGTCCTGATCTGGCTGCTGCGCTCGGTC 

45460 + + + + + + 45519 

TGGCAGGGGCGCCAGGACGACCACCAGGACGAGCAGGACTAGACCGACGACGCGAGCCAG 
27 TVPAVLLVVLLVLIWLLRSV 

ACCGGACCGCTGATCATGCTCGGCACCGTGGTCGTGTCGTTCTTCGCGGCCCTGGGGGCG 
45520 + + + + + + 45579 
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TGGCCTGGCGACTAGTACGAGCCGTGGCACCAGCACAGCAAGAAGCGCCGGGACCCCCGC 
27 TGPL IMLGTVVVSFFAALGA 

TCCAACCTGCTCTTCGAGTACGTGATGGGGCACGCCGGCGTCGACTGGTCGGTGCCGCTT 

45580 + + + + + + 45639 

AGGTTGGACGAGAAGCTCATGCACTACCCCGTGCGGCCGCAGCTGACCAGCCACGGCGAA 
27 SNLLFEYVMGHAGVDWSVPL 

CTCGGGTTCGTGTACCTGGTCGCCCTCGGAATCGACTACAACATCTTCCTCATGCACCGG 

45640 + + + + + + 45699 

GAGCCCAAGCACATGGACCAGCGGGAGCCTTAGCTGATGTTGTAGAAGGAGTACGTGGCC 
27. LGFVYLVALGIDYNI FLMHR 

GTGAAGGAGGAGGTCGCTCTGCACGGCCATGCCAAGGGCGTGCTCACCGGCCTGACCACC 

45700 + + + + + + 45759 

CACTTCCTCCTCCAGCGAGACGTGCCGGTACGGTTCCCGCACGAGTGGCCGGACTGGTGG 
27 VKEEVALHGHAKGVLTGLTT 

ACCGGGGGCGTCATCACCAGTGCCGGCGTGGTCCTGGCCGCGACGTTCGCCGTCATCGCC 

45760 + + + + + + 45819 

TGGCCCCCGCAGTAGTGGTCACGGCCGCACCAGGACCGGCGCTGCAAGCGGCAGTAGCGG 
27 TGGVITSAGVVLAATFAVIA 

ACACTGCCGCTGGTCCCGATGGCCCAGATGGGTGTCGTGGTCGGCCTGGGCATTCTGCTG 

45820 + + + + + + 45879 

TGTGACGGCGACCAGGGCTACCGGGTCTACCCACAGCACCAGCCGGACCCGTAAGACGAC 
27 TLPLVPMAQMGVVVGLGILL 

GACACCTTCCTCGTCCGGACGATTCTTCTGCCGGCCCTGGCGCTCGATCTGGGGCCCCGG 

45880 + + + + + + 45939 

CTGTGGAAGGAGCAGGCCTGCTAAGAAGACGGCCGGGACCGCGAGCTAGACCCCGGGGCC 
27 DTFLVRT ILLPALALDLGPR 

TTCTGGTGGCCGGGCGCGCTGTCGAAGACGTCCGGGGGACCGGCCCCCGTCCGCGAGGAC 

45940 + + + + + + 45999 

AAGACCACCGGCCCGCGCGACAGCTTCTGCAGGCCCCCTGGCCGGGGGCAGGCGCTCCTG 
27 FWWPGALSKTSGGPAPV-RED 

CGCACGTCCCAGCCCGTGGGCTGAGACCCGTCCCGACGAGACCCGTACGGCGGGCGGCCG 

46000 + + + + + + 46059 

GCGTGCAGGGTCGGGCACCCGACTCTGGGCAGGGCTGCTCTGGGCATGCCGCCCGCCGGC 
27 RTSQPVG* (ORF27) 

GTTCCCCCGGGCCGTACGACTGAGCAACCCAGAAGATGGGCCGCCCGCGACCAGGCGTCA 

46060 + + + + + + 46119 

CAAGGGGGCCCGGCATGCTGACTCGTTGGGTCTTCTACCCGGCGGGCGCTGGTCCGCAGT 



CGATGGTGGCCCACCGGCCGCAGGCCGATCTCCCGGAAGGAAGCGCQGTGpTGGGCpATG 



46179 



46120 + + + + + -- 

GCTACCACCGGGTGGCCGGCGTCCGGCTAGAGGGCCTTCCTTCGCGC^CACAACCCGCTAC 
28 (ORF28)/ V L G D E - 



AGGACGGCAAGGCCGCCGAGCTGTGGTCGATGGCGAACCTGGGTACACCGATGGCCGTGC 

46180 + + + + + + 46239 

TCCTGCCGTTCCGGCGGCTCGACACCAGCTACCGCTTGGACCCATGTGGCTACCGGCACG 
28 DGKAAELWSMANLGTPMAVR- 

GCGTCGCGGCGACCCTGCGCATCGCCGACCACATCACGGCCGGAGCGCACACCGCCGGCG 

46240 + + + + + + 46299 

CGCAGCGCCGCTGGGACGCGTAGCGGCTGGTGTAGTGCCGGCCTCGCGTGTGGCGGCCGC 
28 VAATLR IADH I TAGAHTAGE- 

AAATCGCCGAAGCGGCCGCCGTGCACGAGGAATCCCTCGACCGGCTGCTGCGCTACCTCA 

46300 + + + + + + 46359 

TTTAGCGGCTTCGCCGGCGGCACGTGCTCCTTAGGGAGCTGGCCGACGACGCGATGGAGT 
28 IAEAAAVHEE S LDRLLRYLT- 
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CCGTCCGGGGCCTGCTGGACCGTGACGGGCTCGGCCGGTACACGCTGACCCCCCTGGGCC 

46360 + + + + + + 46419 

GGCAGGCCCCGGACGACCTGGCACTGCCCGAGCCGGCCATGTGCGACTGGGGGGACCCGG 
28 VRGLLDRDGLGRYTLTP LGR- 

GGCCGCTGTGCGAGGACCACCCCGCCGGCGTCCGGGCCTGGTTCGACATGGAGGGAGCGG 

46420 + + + + + + 46479 

CCGGCGACACGCTCCTGGTGGGGCGGCCGCAGGCCCGGACCAAGCTGTACCTCCCTCGCC 
28 PLCEDHPAGVRAWFDMEGAG- 

GGCGGGGCGAGCTGTCGTTCGTCGACCTGCTGCACAGCGTACGGACCGGGAAGGCCGCCT 

46480 + + + + + + 46539 

CCGCCCCGCTCGACAGCAAGCAGCTGGACGACGTGTCGCATGCCTGGCCCTTCCGGCGGA 
28 RGELS FVDLLHSVRTGKAAF- 

TCCCCCTGCGCTACGGCCGCCCCTTCTGGGAGGACCTGGCGGAGGACCCCCGCCGCGCGG 

46540 + + + + + + 46599 

AGGGGGACGCGATGCCGGCGGGGAAGACCCTCCTGGACCGCCTCCTGGGGGCGGCGCGCC 
28 PLRYGRP FWEDLAED P R R A E - 

AGTCCTTCAACCGGCTGCTCGGCCAGGACGTCGCCACTCGCGCCCCGGCCGTGGTGGCCG 

46600 + + + + + + 46659 

TCAGGAAGTTGGCCGACGAGCCGGTCCTGCAGCGGTGAGCGCGGGGCCGGCACCACCGGC 
28 S FNRLLGQDVATRA PAVVAG- 

GCTTCGACTGGGCGAGCACCGGTCATGTCATCGACCTCGGAGGCGGCGACGGCTCCCTGC 

46660 + + + + + + 46719 

CGAAGCTGACCCGCTCGTGGCCAGTACAGTAGCTGGAGCCTCCGCCGCTGCCGAGGGACG 
28 FDWASTGHVI DLGGGDGSLL- 

TGACCGCACTGCTGACCGCCTGTCCGTCACTGCGCGGCACGGTCCTGGACCTGCCCGAAG 

46720 + + + + + + 46779 

ACTGGCGTGACGACTGGCGGACAGGCAGTGACGCGCCGTGCCAGGACCTGGACGGGCTTC 
28 TALLTACPSLRGTVLDLPEA- 

CGGTGCAGCGTGCCAAGGAGTCGTTCGCCGTGTCCGGACTGGACGACCGGGCGAACGCGG 

46780 + + --' + + + + 46839 

GCCACGTCGCACGGTTCCTCAGCAAGCGGCACAGGCCTGACCTGCTGGCCCGCTTGCGCC 
28 VQRAKE S FAVSGLDDRANAV- 

TCGCGGGCAGCTTCTTCGACGCCCTCCCCGCCGGCGCGGGCGCCTACGTCCTGTCCCTGG 

46840 + + + + + + 46899 

AGCGCCCGTCGAAGAAGCTGCGGGAGGGGCGGCCGCGCCCGCGGATGCAGGACAGGGACC 
28 AGSFFDALPAGAGAYV LSLV- 

TCCTGCACGACTGGGACGACGAGGCGTCCGTCGCGATCCTGCGGCGCTGCGCCGAGGCGG 

46900 + + + + + + 46959 

AGGACGTGCTGACCCTGCTGCTCCGCAGGCAGCGCTAGGACGCCGCGACGCGGCTCCGCC 
28 LHDWDDEASVAI LRRCAEAA- 

CGGGGCAGACGGGATCGGTGTTCGTCATCGAGTCGACCGGCTCGGCGGGGGACGCCCCGC 

46960 + + + + + + 47019 

GCCCCGTCTGCCCTAGCCACAAGCAGTAGCTCAGCTGGCCGAGCCGCCCCCTGCGGGGCG 
28 GQTGSVFVI ESTGSAGDAPH- 

ACACAGGTATGGACCTGCGCATGCTGTGCATCTACGGAGCCAAGGAGCGCCGCGTGGAGG 

47020 + + + + + + 47079 

TGTGTCCATACCTGGACGCGTACGACACGTAGATGCCTCGGTTCCTCGCGGCGCACCTCC 
28 TGMDLRMLC IYGAKERRVEE- 

AGTTCGAGGAACTCGCCGGCCGGGCCGGGCTCCGGGTCGTCGCCGTCCACCCCGCGGGCC 

47080 + + + + + + - 47139 

TCAAGCTCCTTGAGCGGCCGGCCCGGCCCGAGGCCCAGCAGCGGCAGGTGGGGCGCCCGG 
28 FEELAGRAGLRVVAVH PAG P - 
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CTTCCGCGATCATCCAGATGTCCGCGGTCTGACCGCCCGGAGCCCCGGCCCATCGCGGCG 

47140 + + + + - + + 47199 

GAAGGCGCTAGTAGGTCTACAGGCGCCAGACTGGCGGGCCTCGGGGCCGGGTAGCGCCGC 

28 SAIIQMSAV* (ORF28) 

CGGGCCACGGCAGACAAGGAGAGAGCGTATGGCCGGCCTGGTCATGTCGCCGGTGGAGGC 

47200 + + + + + + 47259 

GCCCGGTGCCGTCTGTTCCTCTCTCGCATACCGGCCGGACCAGTACAGCGGCCACCTCCG 

(ORF29) MAGLVMS PVEA- 

GCTCGACGCGCTGGGCACGGTGCAGGGGCGTCAGGACCCCTATCCCTTCTACGAGGCGAT 

47260 + + + + + + 47319 

CGAGCTGCGCGACCCGTGCCACGTCCCCGCAGTCCTGGGGATAGGGAAGATGCTCCGCTA 
29 LDALGTVQGRQDPYPFYEAI 

CCGCGCGCACGGGCAGGCGGTCCCCACGAAGCCCGGCCGCTTCGTGGTGGTCGGCCACGA 

47320 + + + + + + 47379 

GGCGCGCGTGCCCGTCCGCCAGGGGTGCTTCGGGCCGGCGAAGCACCACCAGCCGGTGCT 

29 RAHGQAVPTKPGRFVVVGHD 

CGCGTGCGACCGGGCGCTGCGGGAACCGGCCCTGCGCGTCCAGGACGCCAGGAGCTACGA 

47380 + + + + + + 47439 

GCGCACGCTGGCCCGCGACGCCCTTGGCCGGGACGCGCAGGTCCTGCGGTCCTCGATGCT 
29 ACDRALREPALRVQDARSYD 

CGTCGTCTTCCCCTCGTGGCGGTCGCACTCCTCGGTCCGGGGGTTCACCAGCTCCATGCT 

47440 + + + + + + 47499 

GCAGCAGAAGGGGAGCACCGCCAGCGTGAGGAGCCAGGCCCCCAAGTGGTCGAGGTACGA 
29 VVFPSWRSHSSVRGFTSSML- 

CTACAGCAACCCGCCCGATCACGGCCGGTTGCGCCAGGTGGTGAGCTTCGCGTTCACCCC 

47500 + + + + + + 47559 

GATGTCGTTGGGCGGGCTAGTGCCGGCCAACGCGGTCCACCACTCGAAGCGCAAGTGGGG 
29 YSNPPDHGRLRQVVSFAFTP 

GCCCAAGGTGCGCCGGATGCACGGGGTGATCGAGGACATGACCGACCGGCTCCTCGACCG 

47560 + + + + + + 47619 

CGGGTTCCACGCGGCCTACGTGCCCCACTAGCTCCTGTACTGGCTGGCCGAGGAGCTGGC 
29 PKVRRMHGVIEDMTDRLLDR- 

GATGGCCCGGCTCGGCTCCGGCGGCTCCCCGGTCGACCTCATAGCCGAGTTCGCCGCCCG 

47620 + + + + + + 47679 

CTACCGGGCCGAGCCGAGGCCGCCGAGGGGCCAGCTGGAGTATCGGCTCAAGCGGCGGGC 
29 MARLGSGGS PVDL IAEFAAR 

GCTGCCCGTCGCGGTGATCAGCGAGATGATCGGCTTTCCGGCGAAGGACCAGGTGTGGTT 

47680 + . + + + + + 47739 

CGACGGGCAGCGCCACTAGTCGCTCTACTAGCCGAAAGGCCGCTTCCTGGTCCACACCAA 
29 LPVAVISEMIGFPAKDQVWF- 

CCGCGACATGGCCTCCCGGGTCGCCGTGGCGACGGACGGTTTCACCGACCCCGGCGCGCT 

47740 + + + + + + 47799 

GGCGCTGTACCGGAGGGCCCAGCGGCACCGCTGCCTGCCAAAGTGGCTGGGGCCGCGCGA 
29 RDMASRVAVATDGFTDPGAL 

CACGGGGGCCGACGCCGCCATGGACGAGATGAGCGCCTACTTCGACGACCTCCTGGACCG 

47800. + + + + + + 47859 

GTGCCCCCGGCTGCGGCGGTACCTGCTCTACTCGCGGATGAAGCTGCTGGAGGACCTGGC 
29 TGADAAMDEMSAYFDDLLDR- 

TCGCCGCCGCACCCCGGCCGACGACCTGGTCACCCTGCTCGCCGAGGCCCACGACGGCTC 

47860 + + + + + + 47919 

AGCGGCGGCGTGGGGCCGGCTGCTGGACCAGTGGGACGAGCGGCTCCGGGTGCTGCCGAG 
29 RRRTPADDLVTLLAEAHDGS- 

CCCCGGGCGCCTGGACCACGACGAACTGATGGGCACCATGATGGTGCTGCTCACAGCCGG 
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47920 + + + + + + 47979 

GGGGCCCGCGGACCTGGTGCTGCTTGACTACCCGTGGTACTACCACGACGAGTGTCGGCC 
29 PGRLDHDELMGTMMVLLTAG- 



GTTCGAGACCACGAGCTTTCTGATCGGCCACGGGGCGATGATCGCCCTCGAACAACGGGC 

47980 + + + + + + 48039 

CAAGCTCTGGTGCTCGAAAGACTAGCCGGTGCCCCGCTACTAGCGGGAGCTTGTTGCCCG 
29 FETTSFLIGHGAMIALEQRA- 

GCACGCGGCCCGGCTGCGGGCCGAACCCGACTTCGCCGACGGCTACGTCGAGGAGATCCT 

48040 + + + + + + 48099 

CGTGCGCCGGGCCGACGCCCGGCTTGGGCTGAAGCGGCTGCCGATGCAGCTCCTCTAGGA 
29 HAARLRAEPDFADGYVEEIL- 



CAGGTTCGAGCCGCCGGTCCACGTCACCAGCCGGTGGGCTGCCGAGGACCTCGACCTGCT 

48100 + + + + + + 48159 

GTCCAAGCTCGGCGGCCAGGTGCAGTGGTCGGCCACCCGACGGCTCCTGGAGCTGGACGA 
29 RFEPPVHVTSRWAAEDLDLL- 



GGGCCTGTCCGTACCGGCGGGCTCCAAGCTGGTCCTGATCCTGGCCGCCGCGAATCGCGA 

48160 + + + + + + 48219 

CCCGGACAGGCATGGCCGCCCGAGGTTCGACCAGGACTAGGACCGGCGGCGCTTAGCGCT 
29 GLSVPAGSKLVLI LAAANRD 



TCCCGGCCGCTACCCCGAGCCCGGCCGCTTCGACCCCGACCGCTACGCGCCCCGGCCGGG 

48220 + + + + + + 48279 

AGGGCCGGCGATGGGGCTCGGGCCGGCGAAGCTGGGGCTGGCGATGCGCGGGGCCGGCCC 
29 PGRYPE PGRFDPDRYAPRPG- 



CGGGCCGGAGGCCACCAGACCGCTGAGCTTCGGCGCGGGCGGCCACTTCTGCCTCGGCGC 

48280 + + + + + + 48339 

GCCCGGCCTCCGGTGGTCTGGCGACTCGAAGCCGCGCCCGCCGGTGAAGACGGAGCCGCG 
29 GPEATRPLSFGAGGHFCLGA- 



TCCGCTGGCGCGGCTGGAAGCCCGGATCGCGCTGCCGCGTCTGCTGCGCCGCTTCCCGGA 

48340 + + + + + + 48399 

AGGCGACCGCGCCGACCTTCGGGCCTAGCGCGACGGCGCAGACGACGCGGCGAAGGGCCT 
29 PLARLEARIALPRLLRRFPD- 



CCTGGCCGTGTCCGAGCCCCCCGTCTACCGCGACCGCTGGGTCGTCCGCGGCCTCGAAAC 

48400 + + + + + + 48459 

GGACCGGCACAGGCTCGGGGGGCAGATGGCGCTGGCGACCCAGCAGGCGCCGGAGCTTTG 
29 LAVS E P PVYRDRWVVRGLET 



CTTTCCCGTGACCCTCGGGTCCTGAGCCCCCGCCGGCCGGAACACGTGACCGTCCCGGCC 

48460 + + + + + + 48519 

GAAAGGGCACTGGGAGCCCAGGACTCGGGGGCGGCCGGCCTTGTGCACTGGCAGGGCCGG 
29 FPVTLGS* (ORF29) 

GGCGGGTGCGCGCCCTCTCAGACGTACAGGGTGTTGGGCCCCTGACCACACAGCACCCGG 

48520 + + + + + + 48579 

CCGCCCACGCGCGGGAGAGTCTGCATGTCCCACAACCCGGGGACTGGTGTGTCGTGGGCC 



CCGTACAGCTCCAGGTTGGTGCTCGGGTTCATGCAGGTGCAGCGTGATGCTCTGGGCATC 

48580 + + + + + + 48639 

GGCATGTCGAGGTCCAACCACGAGCCCAAGTACGTCCACGTCGCACTACGAGACCCGTAG 
30 (ORF30)* APAAHHE PC 

GCTGCACGCGCTGGATCGGGACGTCGTTGTAGATCGAGGACCCGCCGCTCGCCTGGGCGA 

48640 + + + + + + 48699 

CGACGTGCGCGACCTAGCCCTGCAGCAACATCTAGCTCCTGGGCGGCGAGCGGACCCGCT 
30 RQVRQI PVDNYI SSGGSAQA 

GGATGTCCACCGACTCCTTGCCCAGTCGGCACGCCCGCCCCAGCAGGCCGCGGCACAGCA 

48700 + + + + + + 48759 

CCTACAGGTGGCTGAGGAACGGGTCAGCCGTGCGGGCGGGGTCGTCCGGCGCCGTGTCGT 
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30 LIDVSEKGLRCARGLLGRCL 

CCCGCTCCTCCAGCGTCCAGGCCTCGCCCGAAGCCCCCTTGGAGTCGACGAGGTCGGCCA 

48760 + + + + + + 48819 

GGGCGAGGAGGTCGCAGGTCCGGAGCGGGCTTCGGGGGAACCTCAGCTGCTCCAGCCGGT 
30 VREELTWAEGSAGKSDVLDA 



GCCGATGGGCGTGGAACCGTGCCTCGTCGGCCAGCAGGGTCGCCTCGCCGAGCTGCAGGT 

48820 + + + + + + 48879 

CGGCTACCCGCACCTTGGCACGGAGCAGCCGGTCGTCCCAGCGGAGCGGCTCGACGTCCA 
30 LRHAHFRAEDALLTAEGLQL 

GGGTGATCGGCGCCGAGCCCTGCTCCTCGTACTCGGTGTAGGTGATCTTGCGGCCGGGCA 

48880 + + + + + + 48939 

CCCACTAGCCGCGGCTCGGGACGAGGAGCATGAGCCACATCCACTAGAACGCCGGCCCGT 
30 HTI PASGQEEYETYTI KRGP 

GCCTCCCGCGGAAGACGTCCTGAGCGGCCGCGGCCAGTCCGGTCATGGTGCCGACCGACG 

48940 + + + + + + 48999 

CGGAGGGCGCCTTCTGCAGGACTCGCCGGCGCCGGTCAGGCCAGTACCACGGCTGGCTGC 
30 LRGRFVDQAAAALGTMTGVS 

AGGCCGAGGCCACGGCCAGCATCGGCGCCCGGAACATCGGTGATCCGGCGTTGAGTTCGG 

49000 + + + + + + 49059 

TCCGGCTCCGGTGCCGGTCGTAGCCGCGGGCCTTGTAGCCACTAGGCCGCAACTCAAGCC 
30 SASAVALMPARFMPSGANLE 



AGGCGTACTGCTGCTGGAGCACCGCGCCCAGCGGAAGGACGCGCTCCTGGGGAACGAAGA 

49060 + + + + + + 49119 

TCCGCATGACGACGACCTCGTGGCGCGGGTCGCCTTCCTGCGCGAGGACCCCTTGCTTCT 
30 SAYQQQLVAG LPLVREQ PVF 

CGTCCGCGGCGATGGTGCTGACGCTTCCCGAGCCCCGGAGCCCCGAGGTGTGCCAGTCGT 

49120 + + + + + + 49179 

GCAGGCGCCGCTACCACGACTGCGAAGGGCTCGGGGCCTCGGGGCTCCACACGGTCAGCA 
30 VDAAITSVSGSGRLGSTHWD 



CGACGATCTGCAGCTGGTCGGTCGGCACCAGGGCCATCACGGGCTGCATGCCGCCGTCGG 

49180 + + + + + + 49239 

GCTGCTAGACGTCGACCAGCCAGCCGTGGTCCCGGTAGTGCCCGACGTACGGCGGCAGCC 
30 DVIQLQDTPVLAMVPQMGGD 

GGGTCGGTGAGACGGCGATCAGAACCTGCCAGTGACTGTGCCAGGCACCGCTGATGAAGC 

49240 + + + + + + 49299 

CCCAGCCACTCTGCCGCTAGTCTTGGACGGTCACTGACACGGTCCGTGGCGACTACTTCG 
30 PTPSVAILVQWHSHWAGSIF 

CCCACTTGCCGTTCACTACGACACCGCCGTCGACCGGGGCCGCCATGCCGCCGGGACTGA 

49300 + + + + + + 49359 

GGGTGAACGGCAAGTGATGCTGTGGCGGCAGCTGGCCCCGGCGGTACGGCGGCCCTGACT 
30 GWKGNVVVGGDVPAAMGGPS 



GGGTGCCGGAGACCCGGACATCCGGCCGGGAGAACACCTCGTCCTGCACGTGGTCGGGGA 

49360 + + + + + + 49419 

CCCACGGCCTCTGGGCCTGTAGGCCGGCCCTCTTGTGGAGCAGGACGTGCACCAGCCCCT 
30 LTGSVRVDPRS FVEDQVHDP 

AGAGGCCCGCCATCCAGGTGGGTATCCACCACACCGAGGCCGTCCAGGCGGCCGATCCGT 

49420 + + + + + + 49479 

TCTCCGGGCGGTAGGTCCACCCATAGGTGGTGTGGCTCCGGCAGGTCCGCCGGCTAGGCA 
30 FLGAMWTPI WWVSATWAASG 

CGCCGCGCGCCAGCTCGGCGGCCACGTCCACCAGGGTGCGGGCGTCGGACTCGAAGCCGC 

49480 + + + + + + 49539 

GCGGCGCGCGGTCGAGCCGCCGGTGCAGGTGGTCCCACGCCCGCAGCCTGAGCTTCGGCG 
30 DGRALE AAVDVLTRADS EFG 
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CGTAACGGGCCGGCACGCGCATGCGGAAGATCCCGGCTTCGGCCATCGCCTCGACCGACT 

49540 +- + + + + + 49599 

GCATTGCCCGGCCGTGCGCGTACGCCTTCTAGGGCCGAAGCCGGTAGCGGAGCTGGCTGA 
30 GYRAPVRMRFIGAEAMAEVS 

CCTCGTGCAGCCGCCGGTTCTCCTCGGTCCAGGCCGCGTGGGACTGGAGCAGCGGCCTCA 

49600 + + + + + + 49659 

GGAGCACGTCGGCGGCCAAGAGGAGCCAGGTCCGGCGCACCCTGACCTCGTCGCCGGAGT 
30 EEHLRRNEETWAAHSQLLPR 

GCTTCGAGGCCCGTTCCACCAGTTCGGTACGGGCGGGCGTAGACGTCTGGTCCACTCGAT 

49660 + + + + + + 49719 

CGAAGCTCCGGGCAAGGTGGTCAAGCCATGCCCGCCCGCATCTGCAGACCAGGTGAGCTA 

30 LKSAREVLETRAPTSTQDV 
(ORF30) 

CCTCCAGGAATCATGAGACGCCCTGTCCGCGGTATGCGGAAGCAGGCGTCTGCGCGCATC 

49720 + + + + + + 49779 

GGAGGTCCTTAGTACTCTGCGGGACAGGCGCCATACGCCTTCGTCCGCAGACGCGCGTAG 

GGTCAGGACGGCGTCGCCCTGCTCCCGCATGGTTCACCGAGTTCCGCGGACGTCGCATCT 

49780 + + + + 4- + 49839 

CCAGTCCTGCCGCAGCGGGACGAGGGCGTACCAAGTGGCTCAAGGCGCCTGCAGCGTAGA 

CCTTGATTGCCGGTCACCTACCCCGATGCCGATCGGGCTGGTGCGACAGCGCATCCCACG 

49840 + + + + + + 49899 

GGAACTAACGGCCAGTGGATGGGGCTACGGCTAGCCCGACCACGCTGTCGCGTAGGGTGC 

AGAAGTCCACGAACGGTCCGGGAAGCCAGAATGTGCTTCTCGGCCGGAGTCACGGCCGGC 

49900 + + + + + + 49959 

TCTTCAGGTGCTTGCCAGGCCCTTCGGTCTTACACGAAGAGCCGGCCTCAGTGCCGGCCG 

GCCGGCGCCCGTCGCCGGTCACGCCGGACCACGCCCGGACCGGTCATGGAGGCAGCCCAT 

49960 + + + + + + 50019 

CGGCCGCGGGCAGCGGCCAGTGCGGCCTGGTGCGGGCCTGGCCAGTACCTCCGTCGGGTA 

GAGTGACAACGACAGTCCGTCCCGGGTGCCGGCCGCGGTGGCACCCGCCACCGCGAAACC 

50020 + + + + + + 50079 

CTCACTGTTGCTGTCAGGCAGGGCCCACGGCCGGCGCCACCGTGGGCGGTGGCGCTTTGG 

GTCGGCCGGCACGGTCCTCGGCGCCGCGGTGGCTTCGCCCGCCGCCTACACCGCGGCGAC 

50080 + + + + + + 50139 

CAGCCGGCCGTGCCAGGAGCCGCGGCGCCACCGAAGCGGGCGGCGGATGTGGCGCCGCTG 

CGCCCAGGAAGCGGCGACCGCGCTGGTCCGCATGCTGATGGAACAGATGGTGCTCGGTCC 

50140 + + + + + + 50199 

GCGGGTCCTTCGCCGCTGGCGCGACCAGGCGTACGACTACCTTGTCTACCACGAGCCAGG 

CGGCGCGGTCGGTCCCGAGACCCGCGCGGACGGCCCGGCGCGGCGGACCGGCTCCGGCCA 

50200 + + + + + + 50259 

GCCGCGCCAGCCAGGGCTCTGGGCGCGCCTGCCGGGCCGCGCCGCCTGGCCGAGGCCGGT 

CGGCCCGGCGCCGCAGACCGGACCGGACGCGCCGGGCGAACCCGCGCCCACGTGGGCGCC 

50260 + + + + + + 50319 

GCCGGGCCGCGGCGTCTGGCCTGGCCTGCGCGGCCCGCTTGGGGGCGGGTGCACCCGCGG 

GAACCTCGACGACGGGAAGGTAGGAGGACGATGAGGCCGCTCGTTCGGGCAGTGCTGCGG 

50320 + + + + + + 50379 

CTTGGAGCTGCTGCCCTTCCATCCTCCTGCTACTCCGGCGAGCAAGCCCGTCACGACGCC 
31 (ORF31) MRPLVRAVLR 

GGTTCCCTGCGGCAGGTGAGGTACGTGGACGTGGTCTCCCCGCGCCGGGCGCGCTCCCTG 

50380 + + + + + + 50439 

CCAAGGGACGCCGTCCACTCCATGCACCTGCACCAGAGGGGCGCGGCCCGCGCGAGGGAC 

31 GS LRQVRYVDVVS PRRARSL 
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GTGGCGCGGGTGTACCGGGAGACCGAGGAGCAGTTCGGCGTGCTCGCGCCCCCCCTGGCC 

50440 + + + + + + 50499 

CACCGCGCCCACATGGCCCTCTGGCTCCTCGTCAAGCCGCACGAGCGCGGGGGGGACCGG 
31 VARVYRETEEQFGVLAP PLA 

CTCCACTCGCCCGCCGCGGCGTCGCTGGCCGCGACGTGGCTCATGCTGCGGGAGACACTG 

50500 + + + + + + 50559 

GAGGTGAGCGGGCGGCGCCGCAGCGACCGGCGCTGCACCGAGTACGACGCCCTCTGTGAC 
31 LHSPAAASLAATWLMLRETL 

CTGGTCGACGGGCGGGTGAGCCGGGCGGTGAAGGAGACGGTCGCCACCGAGGTCTCCCGT 

50560 + + + + + + 50619 

GACCAGCTGCCCGCCCACTCGGCCCGCCACTTCCTCTGCCAGCGGTGGCTCCAGAGGGCA 
31 LVDGRVSRAVKETVATEVSR 

GCCAACGACTGTCCGTACTGCGTCCAGGTCCATCAGGCGGTACTCGGGACACTGCCTCCG 

50620 + + + + + + 50679 

CGGTTGCTGACAGGCATGACGCAGGTCCAGGTAGTCCGCCATGAGCCCTGTGACGGAGGC 
31 ANDCPYCVQVHQAV LGTLPP 

GACGGCGGCCAGGCCGGGCTCCTGCGGTGGGTCCGGGAGGCAGGCCGACGGCCCGGCGGC 

50680 + + + + + + 50739 

CTGCCGCCGGTCCGGCCCGAGGACGCCACCCAGGCCCTCCGTCCGGCTGCCGGGCCGCCG 
31 DGGQAGLLRWVREAGRRPGG 

GGTGCGGTGGGCGGCGGGCGGCCGCTTCCGTTCAGCGGTGAACAGGCACCGGAACTGTGC 

50740 + + + + + + 50799 

CCACGCCACCCGCCGCCCGCCGGCGAAGGCAAGTCGCCACTTGTCCGTGGCCTTGACACG 
31 GAVGGGRPLPFSGEQAP ELC 

GGCGTCGTGGTCACGTTCCACTACATCAACCGCATGGTCTCCCTCTTCCTCGACGACTCC 

50800 + + + + + + 50859 

CCGCAGCACCAGTGCAAGGTGATGTAGTTGGCGTACCAGAGGGAGAAGGAGCTGCTGAGG 
31 GVVVTFHYINRMVSLFLDDS 

CCCATGCCGACCCGGACGCCGACACCGTTGCGCGGGCCCATCATGAGGACCACCGCACTG 

50860 + + + + + + 50919 

GGGTACGGCTGGGCCTGCGGCTGTGGCAACGCGCCCGGGTAGTACTCCTGGTGGCGTGAC 
31 PMPTRT PTPLRGPIMRTTAL 

GCCATGCGTCCCGTCGGCCCGGGGCTGCTGACACCGGGCGCATCGCTCGGCCTGCTGCCT 

50920 + + + + + + 50979 

CGGTACGCAGGGCAGCCGGGCCCCGACGACTGTGGCCCGCGTAGCGAGCCGGACGACGGA 
31 AMRPVGPGLLTPGASLGLLP 

CCGGCTCCCCTGCCGCCCGGACTGGAGTGGGCCGAGGGCAACCCTTTCGTGGCCCAGGCG 

50980 + + + + + + 51039 

GGCCGAGGGGACGGCGGGCCTGACCTCACCCGGCTCCCGTTGGGAAAGCACCGGGTCCGC 
31 PAPLPPGLEWAEGNPFVAQA 

CTGGGGCGTGCCGTCGCCGCTGTGGACCAGGGAGCGCACTGGGTGCCCGAACCGGTCCGG 

51040 + + + + + + 51099 

GACCCCGCACGGCAGCGGCGACACCTGGTCCCTCGCGTGACCCACGGGCTTGGCCAGGCC 
31 LGRAVAAVDQGAHWVPE PVR 

GAGCGGCTGCGCACACGTCTGGACACCTGGGACGGATCGGCGCCGGGCCTCGGCCGGGGA 

51100 + + + + + + 51159 

CTCGCCGACGCGTGTGCAGACCTGTGGACCCTGCCTAGCCGCGGCCCGGAGCCGGCCCCT 
31 ERLRTRLDTWDGSAPGLGRG 

TGGCTCGACGAGGCCGTGTCCGGCCTGCCGCCCCAGGACGTGCCCGCGGCACGGCTGGCG 

51160 + + + + + + 51219 

ACCGAGCTGCTCCGGCACAGGCCGGACGGCGGGGTCCTGCACGGGCGCCGTGCCGACCGC 
31 WLDEAVSGL PPQDVPAARLA 

CTGCTGACGGCCTTCGCCCCCTACCAGGTGCTCCCGGACGACGTCGAGGAGTTCAGACGG 
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51220 + + + + + + 51279 

GACGACTGCCGGAAGCGGGGGATGGTCCACGAGGGCCTGCTGCAGCTCCTCAAGTCTGCC 
31 LLTAFAPYQVLPDDVEEFRR 

CGTCGGCCCACCGACCGCGAACTCGTCGAGCTCACGTCCTACGCCGCGCTGACCACGGCC 

51280 + + + + + + 51339 

GCAGCCGGGTGGCTGGCGCTTGAGCAGCTCGAGTGCAGGATGCGGCGCGACTGGTGCCGG 
31 RRPTDRELVELTSYAALTTA 

GTCCGTGTCGGTCGCACGCTCGTCGTGCCCGACGCCGCCGGGCCGGGATGAACGGCCCCG 

51340 + + + + + + 51399 

CAGGCACAGCCAGCGTGCGAGCAGCACGGGCTGCGGCGGCCCGGCCCTACTTGCCGGGGC 

31 VRVGRTLVVPDAAGPG* (ORF31) 

CAACGGCTCGGGAAGGCTGTCTCACGGCCGGAGGCGTACGCCGGTGAGGTGCTCGGACTC 

51400 + + + + + + 51459 

GTTGCCGAGCCCTTCCGACAGAGTGCCGGCCTCCGCATGCGGCCACTCCACGAGCCTGAG 

(ORF32) * PRLRVGTLHESE- 

CTCCCAGAGGCGGCGCCGGGCCCTGGGGTCGACGGCTGCTCCGCCGGGGCGCACGAGCCC 

51460 + + + + + + 51519 

GAGGGTCTCCGCCGCGGCCCGGGACCCCAGCTGCCGACGAGGCGGCCCCGCGTGCTCGGG 

32 EWLRRRARPDVAAGG P RVLG- 

GGGTGCGCCCCGGGTCTCGGTCACGCCGAGGGGCCCGTAGAACTCGCCCCCGCGCGCGCC 

51520 + + + + + + 51579 

CCCACGCGGGGCCCAGAGCCAGTGCGGCTCCCCGGGCATCTTGAGCGGGGGCGCGCGCGG 
32 PAGRTETVGL PGYFEGGRAG- 

GGGATCGGTGGCCGCCCGCAGACCAGGCAGCATCCCCGCCGCGGCGGGCTGCAGGAACAA 

51580 + + + + + + 51639 

CCCTAGCCACCGGCGGGCGTCTGGTCCGTCGTAGGGGCGGCGCCGCCCGACGTCCTTGTT 
32 PDTAARLGPLMGAAAPQLFL- 

CGGGGCGAGCGGGGAGCCGAGCCTGCGCACGGGCGCGGGAAAGTCCCGGCCCAGACCGGT 

51640 + + + + + + 51699 

GCCCCGCTCGCCCCTCGGCTCGGACGCGTGCCCGCGCCCTTTCAGGGCCGGGTCTGGCCA 
32 PALPSGLRRVPAPFDRGLGT- 

CGCGGTCAGCCCGGGATGAGCGGCGAGCGAGGCCAGTTCCGCGCCGGACTCCGCCAGTCT 

51700 + + + + + + 51759 

GCGCCAGTCGGGCCCTACTCGCCGCTCGCTCCGGTCAAGGCGCGGCCTGAGGCGGTCAGA 
32 ATLGP HAALSALiEAGS EALR- 

GTGATGGAGTTCCAGCGCGAACATGAGGTTGGCCAGCTTGGACTGGTTGTAGGCCCGGTA 

51760 4- + + + + + 51819 

CACTACCTCAAGGTCGCGCTTGTACTCCAACCGGTCGAACCTGACCAACATCCGGGCCAT 
32 HHLELAFMLNALKS QNYARY- 

CCGGCTGTAGCGGCGTTCGCCGTGAAGGTCGCTGAAGTCGATGCGCCCCAGCCGGTGCAG 

51820 + + + + + + 51879 

GGCCGACATCGCCGCAAGCGGCACTTCCAGCGACTTCAGCTACGCGGGGTCGGCCACGTC 
32 RSYRREGHLDSFDIRGLRHL- 

ATAGCTGCTGATCGTCACGACCCGCGCGCCCGGCGCGGCCCGCAGGCTGTCCAGGAGCAG 

51880 + + + + + + 51939 

TATCGACGACTAGCAGTGCTGGGCGCGCGGGCCGCGCCGGGCGTCCGACAGGTCCTCGTC 
32 YSS ITVVRAGPAARLSDLLL- 

GCCGGTGAGGGCGAAGTGCCCCAGGTGGTTCGTGGCGAACTGGAGTTCGTGACCGTCCGG 

51940 + + + + + + 51999 

CGGCCACTCCCGCTTCACGGGGTCCACCAAGCACCGCTTGACCTCAAGCACTGGCAGGCC 
32 GTLAFHGLHNTAFQLEHGDP- 

GGTGCGGGCCCGGTCGGTCCACATCACGCCCGCGTTGTTGACCAGCAGGTGGATGCGCGG 
52000 + + + + + + 52059 
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CCACGCCCGGGCCAGCCAGGTGTAGTGCGGGCGCAACAACTGGTCGTCCACCTACGCGCC 
32 TRARDTWMVGANNVLLH I R P - 

GAAGCGGTCGCGCAGTTCCTCGGCGCCGGCACGCACCGACGCGAGACGGGAAAGATCCAG 

52060 + + + + + + 52119 

CTTCGCCAGCGCGTCAAGGAGCCGCGGCCGTGCGTGGCTGCGCTCTGCCCTTTCTAGGTC 
32 FRDRLEEAGARVSALRSLDL- 

CCGTCTGACCGTCAGTTGCGCCGACGGCACCCGGCTTTGGATGCGGGCCGCCGCGGCGAC 

52120 + + + + + + 52179 

GGCAGACTGGCAGTCAACGCGGCTGCCGTGGGCCGAAACCTACGCCCGGCGGCGCCGCTG 
32 RRVTLQAS PVRS Q I R A A A A V - 

CCCGCGGTCCGGATCGCGCACGGCCAGCACCACGTGGGCGCCGTGCCGGGCGAGCTCCTG 

52180 + + + + + + 52239 

GGGCGCCAGGCCTAGCGCGTGCCGGTCGTGGTGCACCCGCGGCACGGCCCGCTCGAGGAC 
32 GRD PDRVALVVHAGHRAL E Q - 

CGCCAGGTGCAGTCCGATGCCGGAGCTGGCACCGGTGACCACCGCGGTGGTTCCGGTACG 

52240 + + + + + + 52299 

GCGGTCCACGTCAGGCTACGGCCTCGACCGTGGCCACTGGTGGCGCCACCAAGGCCATGC 
32 ALHLG I GS SAGTVVATTGTR- 

GTCCGGGACATCGGCGGCGCTCCAGCGTCGCCGCGTTCTCATCGGTCGTCCCTCCCGGGG 

52300 + + + + + + 52359 

CAGGCCCTGTAGCCGCCGCGAGGTCGCAGCGGCGCAAGAGTAGCCAGCAGGGAGGGCCCC 
32 DPVDAASWRRRTRM (ORF32) 

GATGCGTCAGCCGGCCTGGGCCATCGCGGCCCGGTAGCCGTTGGCGACGATCTGCCGGGC 

52360 + + + + + + 52419 

CTACGCAGTCGGCCGGACCCGGTAGCGCCGGGCCATCGGCAACCGCTGCTAGACGGCCCG 

GGAGTGCTCGTAGTACTCGTCGTCCTTCGGCAGCTCCGTGGCGAGACCGCTGACGTACCG 

52420 + + + + + + 52479 

CCTCACGAGCATCATGAGCAGCAGGAAGCCGTCGAGGCACCGCTCTGGCGACTGCATGGC 

GTTGAACATGCAGAACGCGGCGGCGATCAGAACGGTGTCGTGCAGAGCGGTGTCGTCCGC 

52480 + + + + + + 52539 

CAACTTGTACGTCTTGCGCCGCCGCTAGTCTTGCCACAGCACGTCTCGCCACAGCAGGCG 

TCCCTCGGCCCGCGCCGAGGCGATCACCCCTGCGGAGACCGGGCGCGCCGCGCTCTGGAC 

52540 + + + + + + 52599 

AGGGAGCCGGGCGCGGCTCCGCTAGTGGGGACGCCTCTGGCCCGCGCGGCGCGAGACCTG 

CTCGGCGGCGACGGCCAGCAGCGCGCGCGTCCTGCCGTCGATGGGCGCGGTGGCGGGGTC 

52600 + + + + + + 52659 

GAGCCGCCGCTGCCGGTCGTCGCGCGCGCAGGACGGCAGCTACCCGCGCCACCGCCCCAG 

GGCGAGGACGGCCTCGACGAGCTGCCGGCCTCCCGGCAGCTGCGCGGCGGCGAAGGCCCC 

52660 + + + + + + 52719 

CCGCTCCTGCCGGAGCTGCTCGACGGCCGGAGGGCCGTCGACGCGCCGCCGCTTCCGGGG 

GTGGGAGGCGGCGCAGAACTCGGTGGAGTTGAGATGCGAGACGTACGCCGCGATGAGCTC 

52720 + + + + + + 52779 

CACCCTCCGCCGCGTCTTGAGCCACCTCAACTCTACGCTCTGCATGCGGCGCTACTCGAG 

GCGTTGCCCCGGTTCCAGCGAGGACGGCGCCCGCAGCAGGGCGTTCGCGAGATCGCCCAG 

52780 + + + + + + 52839 

CGCAACGGGGCCAAGGTCGCTCCTGCCGCGGGCGTCGTCCCGCAAGCGCTCTAGCGGGTC 

CGGTGCTGCGGTGCCGGGGTGGTGAGCCATCAGACCACTGATGCCGGGGAGGTCGTTGTC 

52840 + + + + + + 52899 

GCCACGACGCCACGGCCCCACCACTCGGTAGTCTGGTGACTACGGCCCCTCCAGCAACAG 

GAGTGCTATGTGGGGCACGGCTCTTCCTTCCGGGTGGACGAGGGGCGGACGGCGGCGGAT 
52900 + + + + + + 52959 
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CTCACGATACACCCCGTGCCGAGAAGGAAGGCCCACCTGCTCCCCGCCTGCCGCCGCCTA 

CAGGGCCATTCGACTTCGTCGTCGGCGGCCGCGCAGATGCGGGTGAAGGGCCATTCCACG 

+ + + + + + 

GTCCCGGTAAGCTGAAGCAGCAGCCGCCGGCGCGTCTACGCCCACTTCCCGGTAAGGTGC 



TCTTCCCCTCCCGTTGCGGAGTGGGCGGAGGCCGTGGTGAAGAGGGTGACGAGTCCGAAC 

53020 + + + + + + 53079 

AGAAGGGGAGGGCAACGCCTCACCCGCCTCCGGCACCACTTCTCCCACTGCTCAGGCTTG 



GTGCCGAAGAGGAGGGACAGTCGGGCAACGTGAAGTGCGGTACCCATGCGAGCTCCTAGC 

53080 + + + + + + 53139 

CACGGCTTCTCCTCCCTGTCAGCCCGTTGCACTTCACGCCATGGGTACGCTCGAGGATCG 

GAGGGCGGCGTGACCGCGGGACGGTGAGACCTCGTGATGCCAGGAAGCTAGCGAATCGGA 

53140 + + + + + + 53199 

CTCCCGCCGCACTGGCGCCCTGCCACTCTGGAGCACTACGGTCCTTCGATCGCTTAGCCT 

CTGAGGGTGGCAACGATATGCCAGACTTTGGCAACTTGCCTGTGTATCAGCCGGACTGTC 

53200 + + + + + + 53259 

GACTCCCACCGTTGCTATACGGTCTGAAACCGTTGAACGGACACATAGTCGGCCTGACAG 

(ORF33) V Y Q P D C R 

GGCCGCTGGTAAAGACGGAACGGCGAGATCCCGCGACCGCGTCGCAGAGCAGCAGGGTCT 

53260 + + + + + + 53319 

CCGGCGACCATTTCTGCCTTGCCGCTCTAGGGCGCTGGCGCAGCGTCTCGTCGTCCCAGA 

PLVKTERRDPATASQS SRVC- 

GCTCACCCAGCGTCGGGGCGGCCAGCATGTCGCGTACCGGGAGCGTGACGCCCAGCTCGC 

53320 + + + + + + 53379 

CGAGTGGGTCGCAGCCCCGCCGGTCGTACAGCGCATGGCCCTCGCACTGCGGGTCGAGCG 

SPSVGAASMSRTGSVTPSSR- 

GGTTGATCCTGCGGACCAGCCGGGTGATGAGCAGGGAGTCGCCGCCGTGGGCGAAGAAAT 

53380 + + + + + + 53439 

CCAACTAGGACGCCTGGTCGGCCCACTACTCGTCCCTCAGCGGCGGCACCCGCTTCTTTA 

LILRTSRVMSRES PPWAKKS- 

CAGCACCTTCGGAGGGGTCCGGGAAGCCGAGCAGGTCACCCCAGCCGCGCACCAGTACCT 

5 3 44 0 + + + + + + 53499 

GTCGTGGAAGCCTCCCCAGGCCCTTCGGCTCGTCCAGTGGGGTCGGCGCGTGGTCATGGA 

APSEGSGKPSRSPQPRTSTW- 

GGCGGATGTCGCCGGTGGTGACGACCGTGCGCCGGGAGCCCCGACGTGCCGAGCGCAGCC 

53500 + + + + + + 53559 

CCGCCTACAGCGGCCACCACTGCTGGCACGCGGCCCTCGGGGCTGCACGGCTCGCGTCGG 

RMS PVVTTVRREPRRAERSR- 

GCGAGGCATGCACCAGCGCCACCTGGTCGCCGAGGTTGCGCCGCGACAGCTCGCGCAGCG 

53560 + + + + + + 53619 

CGCTCCGTACGTGGTCGCGGTGGACCAGCGGCTCCAACGCGGCGCTGTCGAGCGCGTCGC 

EACTSATWSPRLRRDSSRSD- 

ACACCGTGACGCCGAACCTCTCGGTGATCCTGCGGACCAGCCGCGTGATCAGCAGCGTGT 

53620 + + + + + + 53679 

TGTGGCACTGCGGCTTGGAGAGCCACTAGGACGCCTGGTCGGCGCACTAGTCGTCGCACA 

TVTPNLSVILRTSRVI SSVS- 

CCCCGCCGCGCGCGAAGAAATCCGAATGCTCGGTGAGGTCGGAGCGGCCGAGGAGCTCGC 

53680 + + + + + + 53739 

GGGGCGGCGCGCGCTTCTTTAGGCTTACGAGCCACTCCAGCCTCGCCGGCTCCTCGAGCG 

PPRAKKSECSVRSERPRSSL- 

TCCACGCGCCGACCATGAACTCCCCCACGTCACCGAGCCGGTGCTCGTCGCCGTCGGGGC 

53740 + + + + + + 53799 

AGGTGCGCGGCTGGTACTTGAGGGGGTGCAGTGGCTCGGCCACGAGCAGCGGCAGCCCCG 
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33 HAPTMNSPTSPSRCSSPSGP- 

CCTTCGGCGCGCCGGATCCCGCGGAACGGTTCCGGCCGGAGACGGCAGAGCGGTCACTGG 

53800 + + + + + + 53859 

GGAAGCCGCGCGGCCTAGGGCGCCTTGCCAAGGCCGGCCTCTGCCGTCTCGCCAGTGACC 
33 FGAPDPAERFRPETAERSLV- 

TCACTTTCGCCACCTCCAGGGGCATGTGTCGGCTGCATCGGCTTCCCGCCACGGTACGGG 

53860 + + + + + + 53919 

AGTGAAAGCGGTGGAGGTCCCCGTACACAGCCGACGTAGCCGAAGGGCGGTGCCATGCCC 
33 TFATSRGMCRLHRL PATVRE- 

AGCACATGTTGCATGGCAATACCTTTCC7UVGTCGGTGGCAACCCTCCTTGCCATCCACCC 

53920 + + + + + + 53979 

TCGTGTACAACGTACCGTTATGGAAAGGTTCAGCCACCGTTGGGAGGAACGGTAGGTGGG 
33 HMLHGNTF PSRWQP SLPSTH- 

ACTGCAGTTGGGCGAGATGTGTAGGCATTCGAGGTCCGCAGGTTTGCCAAGCCGCGCGCG 

53980 + + + + + + 54039 

TGACGTCAACCCGCTCTACACATCCGTAAGCTCCAGGCGTCCAAACGGTTCGGCGCGCGC 
33 CSWARCVGI RGPQVCQAARD- 

ACCGGCATACTCTCTGGCACAACTGGAATGAGTAGCGTGGCAGGCCACGGGGACCGGGCC 

54040 + + + + + + 54099 

TGGCCGTATGAGAGACCGTGTTGACCTTACTCATCGCACCGTCCGGTGCCCCTGGCCCGG 

33 RHTLWHNWNE* (ORF33) 

GGGCCAGGAACCTTCGTCCTCCATCTATTCGCTGGGGCGTGCACGTGTTGGAGCAGCCAT 

54100 + + + + + + 54159 

CCCGGTCCTTGGAAGCAGGAGGTAGATAAGCGACCCCGCACGTGCACAACCTCGTCGGTA 

CTTTCGGCCGTCGCCTGAGGCAGCTGAGGACCGAGCGGGGTCTTTCCCAGGCCGCGCTCG 

54160 + + + + + + 54219 

GAAAGCCGGCAGCGGACTCCGTCGACTCCTGGCTCGCCCCAGAAAGGGTCCGGCGCGAGC 

CGGGGGACGGCATGTCTACGGGCTATCTCTCGCGCCTGGAGTCGGGCGCCCGGCAGCCCT 

54220 + + + + + + 54279 

GCCCCCTGCCGTACAGATGCCCGATAGAGAGCGCGGACCTCAGCCCGCGGGCCGTCGGGA 

34 (ORF34) MSTGYLSRLESGARQPS- 

CCGATCGCGCCGTCGCCCACCTGGCCGGACAACTCGGCATCAGCCCGTCGGAGTTCGAAG 

54280 + + + + + + 54339 

GGCTAGCGCGGCAGCGGGTGGACCGGCCTGTTGAGCCGTAGTCGGGCAGCCTCAAGCTTC 
34 DRAVAHLAGQLGI S P SE FEG- 

GGTCCCGGGCCACCTCGCTCGCCCAGATCCTCTCCCTCTCCACTTCCCTGGAGTCCGACG 

54340 + + + + + + 54399 

C CAGGGC CCGGTGGAGCGAGCGGGTCTAGGAGAGGGAGAGGTGAAGGGACCTCAGGCTGC 
34 SRATSLAQ-ILSLSTSLESDE- 

AGACCAGTGAGCTTCTCGCCGAGGCGGTACGTTCCGCGCATGGCCAGGATCCGATGCTCC 

54400 + + + + + + 54459 

TCTGGTCACTCGAAGAGCGGCTCCGCCATGCAAGGCGCGTACCGGTCCTAGGCTACGAGG 
34 TSELLAEAVRSAHGQDPMLR- 

GCTGGCAGGCCCTGTGGCTGCTGGGACAGTGGAAGCGCCGGCACGGCGACTCGGCCGGCG 

54460 + + + + + + 54519 

CGACCGTCCGGGACACCGACGACCCTGTCACCTTCGCGGCCGTGCCGCTGAGCCGGCCGC 
34 WQALWLLGQWKRRHGD SAGE- 

AGCACGGCTACCTCCAGCGTCTGGTGACGCTGAGTGAGGAGATCGGCCTGGCCGAGTTGC 

54520 + + + + + + 54579 

TCGTGCCGATGGAGGTCGCAGACCACTGCGACTCACTCCTCTAGCCGGACCGGCTCAACG 
34 HGYLQRLVTLS EE I GLAELR- 

GCGCACGGGCCCTGACCCAGTTCGCCCGGTCGCTGCGGGTACTGGGCGAGATCGTTCCGG 
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54580 + + + + + + 54639 

CGCGTGCCCGGGACTGGGTCAAGCGGGCCAGCGACGCCCATGACCCGCTCTAGCAAGGCC 
34 ARALTQFARSLRVLGEIV PA- 

CGGTGGAGGCTGCCGCCGCCGCCCACCGGCTCGCGGTGGACCATGCGCTGTCCAGCCAGG 

54640 + + + + + + 54699 

GCCACCTCCGACGGCGGCGGCGGGTGGCCGAGCGCCACCTGGTACGCGACAGGTCGGTCC 
34 VEAAAAAHRLAVDHALSSQD- 

ACAGGGCCGCTTCGCTGCTGGTTCTGGTGTCGGTGGAGGCCGAGGCGGGACGGATGCCCG 

54700 + - + + + + + 54759 

TGTCCCGGCGAAGCGACGACCAAGACCACAGCCACCTCCGGCTCCGCCCTGCCTACGGGC 
34 RAASLLVLVSVEAEAGRMPD- 

ACGCCCGGCGCCACGCCGACGAACTGACCGTCCTGGTGAGGGGACGGTCCGACACTCTGT 

54760 + + + + + + 54819 

TGCGGGCCGCGGTGCGGCTGCTTGACTGGCAGGACCACTCCCCTGCCAGGCTGTGAGACA 
34 ARRHADELTVLVRGRS DTLW- 

GGGCCGAGGCGTTGTGGACGGCGGGTGCGTTGAAGGTGCGGCAGGGCGAGTTCGCCGCGG 

54820 + + + + + + 54879 

CCCGGCTCCGCAACACCTGCCGCCCACGCAACTTCCACGCCGTCCCGCTCAAGCGGCGCC 
34 AEALWTAGALKVRQGEFAAA- 

CCGAGGTCCTTTTCCAGGAGGCTCTGGACGGGTTCGACAGCCGGGAGAACCTGACGATCT 

54880 + + + + + + 54939 

GGCTCCAGGAAAAGGTCCTCCGAGACCTGCCCAAGCTGTCGGCCCTCTTGGACTGCTAGA 
34 EVLFQEALDGFDS RENLT IW- 

GGCTGCGGCTGCGCATCGCGATGGCCGAACTCCACCTGCAGAAACTTCCTCCCGAGCCCG 

54940 + + + + + + 54999 

CCGACGCCGACGCGTAGCGCTACCGGCTTGAGGTGGACGTCTTTGAAGGAGGGCTCGGGC 
34 LRLRIAMAELHLQKLPPEPD- 

ACGCCGCGCAGCTCTGCATCGAGGCGGCGGAGGCGGCCCTTCCCTTTGCCCGCACATCCG 

55000 + + + - + + + 55059 

TGCGGCGCGTCGAGACGTAGCTCCGCCGCCTCCGCCGGGAAGGGAAACGGGCGTGTAGGC 
34 AAQLC I EAAEAALP FARTSA- 

CTCTGGAACAGTCCCTCGCCGCTCTGCGGGCGCGCCTCGCCTTCCATGAGGGCAGGTTCG 

55060 + + + + + + 55119 

GAGACCTTGTCAGGGAGCGGCGAGACGCCCGCGCGGAGCGGAAGGTACTCCCGTCCAAGC 
34 LE QS LAALRARLAFH EGRFA- 

CCGATGCCCGCGCGTTGTTGGAGAGGCTCGGCAGGACCGAGCTCCGGCTGCCCTATCAGA 

55120 + + + + + + 55179 

GGCTACGGGCGCGCAACAACCTCTCCGAGCCGTCCTGGCTCGAGGCCGACGGGATAGTCT 
34 DARALLERLGRTELRL PYQS- 

GCCGGATCCGCCTGGAGGTCCTCGGTCATCAGCTGCGCATCCTGAGCGGGGAGGAGGAGG 

55180 + + + + + + 55239 

CGGCCTAGGCGGACCTCCAGGAGCCAGTAGTCGACGCGTAGGACTCGCCCCTCCTCCTCC 
34 RI RLEVLGHQLRI LSGEEEE- 

AAGGCCTGGCCGGCCTCCAGCTCCTGGCCGAGGAGGCGCAGGAGAACTCCAACATCAACC 

55240 + + + + + + 55299 

TTCCGGACCGGCCGGAGGTCGAGGACCGGCTCCTCCGCGTCCTCTTGAGGTTGTAGTTGG 
34 GLAGLQLLAEEAQENSNINL- 

TCGCCGCGGAGATCTGGCGGCTCGCGGCGGAATGCCTGATGCGGGCGCGCGGGAAGGTCC 

55300 + + + : + + + 55359 

AGCGGCGCCTCTAGACCGCCGAGCGCCGCCTTACGGACTACGCCCGCGCGCCCTTCCAGG 
34 AAEIWRLAAECLMRARGKVR- 

GCGGCGCCACCGGCGGCTGACGCCGCGCCGGTTCGCGAGGTCCACCGCGCCGCCGTGGCC 
55360 + + + + + + 55419 



69 





CGCCGCGGTGGCCGCCGACTGCGGCGCGGCCAAGCGCTCCAGGTGGCGCGGCGGCACCGG 

34 G A T G G * (ORF34) 

ACCGCCGTCGGCGTGAGGCGCCGGCGTGTGCCGCCCCCCACGGTTGCTCGCCCTTGGTGG 

55420 + + + + + + 55479 

TGGCGGCAGCCGCACTCCGCGGCCGCACACGGCGGGGGGTGCCAACGAGCGGGAACCACC 

TGCATCTGTTGGCACATGTGTACCTCCTACACAGTCAATTGTTGCCAAAATTGTCGAACC 

55480 + + + + + + 55539 

ACGTAGACAACCGTGTACACATGGAGGATGTGTCAGTTAACAACGGTTTTAACAGCTTGG 

GAATGGCAATTGCTTGCCTTTGCTGAAGAGGCGTGCTGATATGCAAGTCAAGTAGCCTCC 

55540 + + + + + + 55599 

CTTACCGTTAACGAACGGAAACGACTTCTCCGCACGACTATACGTTCAGTTCATCGGAGG 

TCCGATCTCGGGCGGCCATATGGGAAACATCGAGTTGAGCGGCGATGGCGTTCGTCAGTG 

55600 + + + + + + 55659 

AGGCTAGAGCCCGCCGGTATACCCTTTGTAGCTCAACTCGCCGCTACCGCAAGCAGTCAC 

CTGCCGTTCTGGCCAGGCAACTGATGTCGATGGGGATGGCAAGATTTTGCCGAAAACCGA 

55660 + -f + + + + 55719 

GACGGCAAGACCGGTCCGTTGACTACAGCTACCCCTACCGTTCTAAAACGGCTTTTGGCT 

TACATCTCTGTCCGTCCCGGACAGCCTTCGCCCCCCGGGTGACACTGCTCCGGCATGGCT 

55720 + + + + + + 55779 

ATGTAGAGACAGGCAGGGCCTGTCGGAAGCGGGGGGCCCACTGTGACGAGGCCGTACCGA 

CCGGTTTCTCGTCGCCCGGCCGACGGACCGCACCGTCCGGAACGAGGCGCCGGTGTGCGT 

55780 + + + + + + 55839 

GGCCAAAGAGCAGCGGGCCGGCTGCCTGGCGTGGCAGGCCTTGCTCCGCGGCCACACGCA 

CCGCTGATGGGCACAGCGGCCTCGGCCGCAGCAGGTTCCCACCGAGAAGAATGCCGAGGC 

55840 + + + + + + 55899 

GGCGACTACCCGTGTCGCCGGAGCCGGCGTCGTCCAAGGGTGGCTCTTCTTACGGCTCCG 

CCAGCCGTGAACCACGACATGTCCCAGCGTGCCTTGCTGGAGGCGGCGGCCGAGGGGCTG 

55900 + + + + + + 55959 

GGTCGGCACTTGGTGCTGTACAGGGTCGCACGGAACGACCTCCGCCGCCGGCTCCCCGAC 

CGGCGGCTGGCCGGCGACGCGCGGTGCCGGAGCGCGTCGGCCGCGCCCTCCTCGGCATTG 

55960 + + + + + + 56019 

GCCGCCGACCGGCCGCTGCGCGCCACGGCCTCGCGCAGCCGGCGCGGGAGGAGCCGTAAC 

AGGGACATGTTCTCCCCCGCCGCCCGCCGGTACGTGCTCGCCTCGGACCGCGCGGGGTTC 

56020 + + + + + + 56079 

TCCCTGTACAAGAGGGGGCGGCGGGCGGCCATGCACGAGCGGAGCCTGGCGCGCCCCAAG 

35 (ORF35) MFSPAARRYVLASDRAGF 

TTCGAGCAGGCTGTCCGGCTGCGCTCCCGGGGGTACCGGGTGAGCGCGGAGTTCGTCGGC 

56080 + + + + + + 56139 

AAGCTCGTCCGACAGGCCGACGCGAGGGCCCCCATGGCCCACTCGCGCCTCAAGCAGCCG 
35 FEQAVRLRSRGYRVSAEFVG 

CCCGATCAGGGAGCCACCGACGCCCTCCACGCGGAGCACGTGGTCGAAGAGCACCTGAGG 

56140 + + + + + + 56199 

GGGCTAGTCCCTCGGTGGCTGCGGGAGGTGCGCCTCGTGCACCAGCTTCTCGTGGACTCC 
35 PDQGATDALHAEHVVEEHLR 

CTGCTCGATCAGGAGCCGGCCCCTGACCGGATCGGTGTGGACGTCTCCCGGATCGGCCTC 

56200 + + + + + + 56259 

GACGAGCTAGTCCTCGGCCGGGGACTGGCCTAGCCACACCTGCAGAGGGCCTAGCCGGAG 
35 LLDQEPAPDRIGVDVSRIGL 

GCCCACTCGGCGCAGACTGCCCTGCGCAACACCGGGCGGCTGGCTGCCGCTGCGGCGCTC 

56260 + + + + + + 56319 

CGGGTGAGCCGCGTCTGACGGGACGCGTTGTGGCCCGCCGACCGACGGCGACGCCGCGAG 
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35 AHSAQTALRNTGRLAAAAAL 

CGCGGGAGCGAGGTCGTCCTGCTCATGGAGGGGTCCGAGGACATCGACACCGTGCTGGCC 

56320 + + + + + + 56379 

GCGCCCTCGCTCCAGCAGGACGAGTACCTCCCCAGGCTCCTGTAGCTGTGGCACGACCGG 
35 RGSEVVLLMEGSEDIDTVLA 

GTCCATGACGCCCTGGTGAACCGTTACGACAACGTGGGGATCACCCTTCAGGCGCACCTG 

56380 + + + + + + 56439 

CAGGTACTGCGGGACCACTTGGCAATGCTGTTGCACCCCTAGTGGGAAGTCCGCGTGGAC 
35 VHDALVNRYDNVGITLQAHL 

CACCGCACCGTGGACGACGCCATGGCGGTCGCGGGTCCTGGCCGCACCGTGCGGCTGGTC 

56440 + + + + + + 56499 

GTGGCGTGGCACCTGCTGCGGTACCGCCAGCGCCCAGGACCGGCGTGGCACGCCGACCAG 
35 HRTVDDAMAVAG PGRTVRLV 

ATGGGCTCCTCGGCCGAGCCTGCCGGCACCGCTCTGTCCCGGGGCCCCGCTCTGGAGGAC 

56500 + + + + + + 56559 

TACCCGAGGAGCCGGCTCGGACGGCCGTGGCGAGACAGGGCCCCGGGGCGAGACCTCCTG 
35 MGSSAE PAGTALSRGPALED 

CGGTACCTTGACCTCGCGGAGCTTCTCGTGGACCGTGGCGTCCGGCTGAGTCTGGCCACT 

56560 + + + + + + 56619 

GCCATGGAACTGGAGCGCCTCGAAGAGCACCTGGCACCGCAGGCCGACTCAGACCGGTGA 
35 RYLDLAELLVDRGVRLSLAT 

CCGGACGCCGAGGTCCTGGCCGGGGCGCAGGAGCGTGGTCTGCTCGAACGCGTCCAGGAC 

56620 + + + + + + 56679 

GGCCTGCGGCTCCAGGACCGGCCCCGCGTCCTCGCACCAGACGAGCTTGCGCAGGTCCTG 
35 PDAEVLAGAQERGLLERVQD 

ATCGAGATGCTCTACGGTGTGCGGCCCGAGCTGCTGCGCCGCCACCGGGCGGCGGGCCGC 

56680 + + + + + + 56739 

TAGCTCTACGAGATGCCACACGCCGGGCTCGACGACGCGGCGGTGGCCCGCCGCCCGGCG 
35 I EMLYGVRPELLRRHRAAGR 

CCCTGTCGCATCCACGCGGCCTACGGGATGAACTGGTGGCTTCCCCTGCTGCGGAGGCTG 

56740 + + + + + + 56799 

GGGACAGCGTAGGTGCGCCGGATGCCCTACTTGACCACCGAAGGGGACGACGCCTCCGAC 
35 PCRIHAAYGMNWWLPLLRRL 

GCCGACAACCCGCCGATGGTGCTCAACGCCCTGGCCGACATCGGCCGGGACCGGGAGCCC 

56800 + + + + + + 56859 

CGGCTGTTGGGCGGCTACCACGAGTTGCGGGACCGGCTGTAGCCGGCCCTGGCCCTCGGG 
35 ADNPPMVLNALADIGRDREP 

GTCGCCCACCAGGCGTACTGACCCGCCCCGGGCCGCGATCCGCGGGGCACCGGCCCCGGG 

56860 + + + + + + 56919 

CAGCGGGTGGTCCGCATGACTGGGCGGGGCCCGGCGCTAGGCGCCCCGTGGCCGGGGCCC 

35 VAHQAY* (OF35) 

GCGCCGGTCAGCTCCCGGTCGCCGCGAACTGCCCGGGCCTGCGCCCCTCGCCCGCCGGCC 

56920 + + + + + + 56979 

CGCGGCCAGTCGAGGGCCAGCGGCGCTTGACGGGCCCGGACGCGGGGAGCGGGCGGCCGG 

36 (ORF36) * SGTAAFQG PRRGEGAP 

CCCGGTAGGCCTGGGCGATGTCCAGCCACTTCTCCGCCTCCTGACCAGACGCGGTCAGGG 

56980 + + + + + + 57039 

GGGCCATCCGGACCCGCTACAGGTCGGTGAAGAGGCGGAGGACTGGTCTGCGCCAGTCCC 
36 GRYAQAIDLWKEAEQGSATL 

CGAGGTCGTCGCGGTGGCGGCGCCGGGTGACCAGCAGGCAGAAGTCGTGCGCGGGACCGC 

57040 + + + + + + 57099 

GCTCCAGCAGCGCCACCGCCGCGGCCCACTGGTCGTCCGTCTTCAGCACGCGCCCTGGCG 
36 ALDDRHRRRTVLLCFDHAPG 
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TGACCGTCTCGGTGGCGTCCTCGGGGCCGACCGTCCAGACCTCGCCCGAGGGGGCGGTGA 

57100 + + + + + + 57159 

ACTGGCAGAGCCACCGCAGGAGCCCCGGCTGGCAGGTCTGGAGCGGGCTCCCCCGCCACT 
36 SVTETA DE PGVTWVEGS PAT 

GCTCGAAGCGGAACGGCGCGGCCGGCGGGGTCAGACCGTGGGACTCGTAGCCGAAGTCGC 

57160 + + + + + + 57219 

CGAGCTTCGCCTTGCCGCGCCGGCCGCCCCAGTCTGGCACCCTGAGCATCGGCTTCAGCG 
36 LEFRFPAAPPTLGHSEYGFD 

GTGTCAGCCAGGCGAAGTCGACGATGTTGCGAAGCCGCTCGGTGGGCGTGCGCCGGACAC 

57220 + + + + + + 57279 

CACAGTCGGTCCGCTTCAGCTGCTACAACGCTTCGGCGAGCCACCCGCACGCGGCCTGTG 
36 RTLWAFDVINRLRETPTRRV 

CCAGGGCGTCGGCGACGTCCTGGCCGTGGGCGAACACCTCCATGATCCCGGCGCAGCCCA 

57280 + + + + + + 57339 

GGTCCCGCAGCCGCTGCAGGACCGGCACCCGCTTGTGGAGGTACTAGGGCCGCGTCGGGT 
36 GLADAVDQGHAFVEM I GACG 

GAACGACCGGCGGCAGCGGGTTGACCAGCCACGGAACCACCTGGCCGGCGGGGACCGCGG 

57340 + + + + + + 57399 

CTTGCTGGCCGCCGTCGCCCAACTGGTCGGTGCCTTGGTGGACCGGCCGCCCCTGGCGCC 
36 LVVPPLPNVLWPVVQGAPVA 

CGAGCGCCTCGACCGAGGCCCGCCCCATGCCCCGGAAGCGGGTGAGCAGTTCCTGCGGCG 

57400 + + + + + + 57459 

GCTCGCGGAGCTGGCTCCGGGCGGGGTACGGGGCCTTCGCCCACTCGTCAAGGACGCCGC 
36 ALAEVSARGMGRFRTLLEQP 

GGAAGCCCTTGAACTGCTGCAGAGCCGCGTTGACCGCTCCGTCGAAGTTGCCTGCCGCGG 

57460 + + + + + + 57519 

CCTTCGGGAACTTGACGACGTCTCGGCGCAACTGGCGAGGCAGCTTCAACGGACGGCGCC 
36 PFGKFQQLAANVAGDFNGAA 

CGGCCGTGACGGCCTTGAACTCCTCCGGCGCCGCCGCCGCGGTCCTGGCCAGGTTGAAGA 

57520 + + + + + + 57579 

GCCGGCACTGCCGGAACTTGAGGAGGCCGCGGCGGCGGCGCCAGGACCGGTCCAACTTCT 
36 AATVAKFE E PAAAATRALNF 

CGAAGGTGAGGTGGGCGATCTGGTCGGTGACGGTCCAGCCGGGCGCCGGCGTCGGAGTGT 

57580 + + + + + + 57639 

GCTTCCACTCCACCCGCTAGACCAGCCACTGCCAGGTCGGCCCGCGGCCGCAGCCTCACA 
36 VFTLHAIQDTVTWGPAPTPT 

TCCAGGCTTCGTCGTCGATCTTCTCGACCAGCTGCGCCAGCTCCTCGATGTCGGTGGCCA 

57640 + + - + + + + 57699 

AGGTCCGAAGCAGCAGCTAGAAGAGCTGGTCGACGCGGTCGAGGAGCTACAGCCACCGGT 
36 NWAEDDI KEVLQALEE IDTA 

GGTGCTTGAGGACGTCGTCGAGCGAATTCATCTCGTACTTCCTTCACTGGGGGTGTTCCG 

57700 + + + + + + 57759 

CCACGAACTCCTGCAGCAGCTCGCTTAAGTAGAGCATGAAGGAAGTGACCCCCACAAGGC 
36 LHKLVDDLSNM (ORF36) 

GGCTGGGACGGATGTCCCGCCGGGTGGGCCGGCGGCCGGCGGAAGCGCCGTCGCGGAGCG 

57760 + + + + + + 57819 

CCGACCCTGCCTACAGGGCGGCCCACCCGGCCGCCGGCCGCCTTCGCGGCAGCGCCTCGC 

TCGGCGACAGTCGCTAGGCGGCGCGTCCCGCGTAGGAGCCGGCCCGGTCGGAATAGGGCG 

57820 + + + + + + 57879 

AGCCGCTGTCAGCGATCCGCCGCGCAGGGCGCATCCTCGGCCGGGCCAGCCTTATCCCGC 
37 (ORF37) *AARGAYSGARDSYP 

CGAGCGCCTCGGCCAGGGCTTCGGGTATCAGGGTCGGCACGGTCGCCGTGTTGGGGCCGC 
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57880 + + + + + ^ 

GCTCGCGGAGCCGGTCCCGAAGCCCATAGTCCCAGCCGTGCCAGCGGCACAACCCCGGCG 

ALAEALAE P I L T PVTATN PG 

GCATGCAGGCGATGCGCTGGCGTCCCCGCGCCACCAGGGTCTCGCCGCCGTCGTCGCCCA 

57940 + + + + + + 57999 

CGTACGTCCGCTACGCGACCGCAGGGGCGCGGTGGTCCCAGAGCGGCGGCAGCAGCGGGT 
RMCAI RQRGRAVLTEGGDDG 

GCTTGATGTAGTCGAAGGTGAACTCCAGCTGGGTCTGCCGCAGCTCCGAGAGCCTCATCC 

58000 + + + + + + 58059 

CGAACTACATCAGCTTCCACTTGAGGTCGACCCAGACGGCGTCGAGGCTCTCGGAGTAGG 
LKIYDFTFELQTQRLESLRM 

GGATCGACAGTTCGTCGAAGGCGGTGATCTCCGCGAAGAACTCGCAGTCCACCTTGAGGG 

58060 + + + + + + 58119 

CCTAGCTGTCAAGCAGCTTCCGCCACTAGAGGCGCTTCTTGAGCGTCAGGTGGAACTCCC 
RISLEDFATIEAFFECDVKL 

TGAAGAGCTTGAGGTCCTCCTGGACCTCGGCGAGCACCGAAGGCGCCCTCTCCTTGAGAA 

58120 + + + + + + 58179 

ACTTCTCGAACTCCAGGAGGACCTGGAGCCGCTCGTGGCTTCCGCGGGAGAGGAACTCTT 
T F L KLD E QVEAL V S PAR E KL 

AGAGTTCCCGGCAACGCCCCTGCCAACGAAGGTAGTTGACGTAGTAGACGTTGCCGACGA 

58180 + + + + + + 58239 

TCTCAAGGGCCGTTGCGGGGACGGTTGCTTCCATCAACTGCATCATCTGCAACGGCTGCT 
FLERCRGQWRLYNVYYVNGV 

GGTTCGTCTCCTCGAAGCCGACGGTGTGGCGGAGCTCGAAGTAGTCAGGATTCGTCGCGG 

58240 + + + + + + 58299 

CCAAGCAGAGGAGCTTCGGCTGCCACACCGCCTCGAGCTTCATCAGTCCTAAGCAGCGCC 
L NTEE FGVTHRLE FYD PNTA 

TCATAGGTCTGTGCCCTTCGTCGTCGGGGCCGGTCGTCGCACCGAGTTGCGTGAAGCAAC 

58300 + + + + + + 58359 

AGTATCCAGACACGGGAAGCAGCAGCCCCGGCCAGCAGCGTGGCTCAACGCACTTCGTTG 

T M (ORF37) 

TCACTGGTCGCGATGGCCTGCGGGGTCGGTGGCCCGCGCTCCGGGCGGAGAGTGCGGGCG 

58360 + + + + + + 58419 

AGTGACCAGCGCTACCGGACGCCCCAGCCACCGGGCGCGAGGCCCGCCTCTCACGCCCGC 

GGGTGCCGGCCGGCGCGGGGTCAGCCGCGCGCCGACGGCAGCAGGGGAAGAACCCTCTCG 

58420 + + + + + + 58479 

CCCACGGCCGGCCGCGCCCCAGTCGGCGCGCGGCTGCCGTCGTCCCCTTCTTGGGAGAGC 
(ORF3 8) *GRAS PLLPLVRE 

CGGCCGCTCGTGGAGCCGTCGGGGGCCGGTGCGCCGTAGGTGACGGAGATACCCCGGCTC 

58480 + + + + + + 58539 

GCCGGCGAGCACCTCGGCAGCCCCCGGCCACGCGGCATCCACTGCCTCTATGGGGCCGAG 
RGSTSGDPAPAGYTVS IGRS - 

TGCGCGGCGCGCACGATCCCCGGCATCGCGCGTTCGGCGAGCGCCGCGATGGTCATCGCG 

58540 + + + + + + 58599 

ACGCGCCGCGCGTGCTAGGGGCCGTAGCGCGCAAGCCGCTCGCGGCGCTACCAGTAGCGC 
QAARVIGPMAREALAAI TMA- 

GGATTGACCGTCAGCGCGCCGGGAACCGACGATCCGTCGGTGACGAAGATCCCCGGGTGG 

58600 + + + + + + 58659 

CCTAACTGGCAGTCGCGCGGCCCTTGGCTGCTAGGCAGCCACTGCTTCTAGGGGCCCACC 
PNVTLAGPVSSGDTVFIGPH- 

TCGCGGAGCTCGTTGCTGTCGTCCAGGGCGGATGTGTGGGGGTCGTCGCCCATCCGGCAG 

58660 + i I h 1 1 58719 

AGCGCCTCGAGCAACGACAGCAGGTCCCGCCTACACACCCCCAGCAGCGGGTAGGCCGTC 
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38 DRLE NSDDLASTHPDDGMRC- 

GAGGAGAGCGGGTGGACGGTGTAGGCGCCGACGAGGTCGTTGGTCCAGGGCATGACCTTG 

58720 + + + + + + 58779 

CTCCTCTCGCCCACCTGCCACATCCGCGGCTGCTCCAGCAACCAGGTCCCGTACTGGAAC 
38 SSLPHVTYAGVLDNTWPMVK- 

GCCAGGCCGTCCTTCTCCAGGATCTCCTTGACCTCGGCGTCGGATGCGGCCCAGGCGCCC 

58780 + + + + + + 58839 

CGGTCCGGCAGGAAGAGGTCCTAGAGGAACTGGAGCCGCAGCCTACGCCGGGTCCGCGGG 

38 ALGDKELI EKVEADSAAWAG 

AGGGTGTTCTTCGTCGGGTCGTAGCGCAGGTTGCCCCGGCCGAGCATCTGCTGGGAGATG 

58840 + + + + + + 58899 

TCCCACAAGAAGCAGCCCAGCATCGCGTCCAACGGGGCCGGCTCGTAGACGACCCTCTAC 
38 LTNKTPDYRLNGRGLMQQS I 

CGGTGGGCGTTACCGGTGGCGGGAGGGGGGCCGAAGACGCCTTCGTTGTCGTCCTCGATC 

58900 + + + + + + 58959 

GCCACCCGCAATGGCCACCGCCCTCCCCCCGGCTTCTGCGGAAGCAACAGCAGGAGCTAG 
38 RHANGTAPPPGFVGENDDEI- 

ATCGTGAAGATCGTGAGCCAGGAGGTCCACTGCTTCAGGATCTCCTTCTTCTCCTTGCCG 

58960 + + + + + + 59019 

TAGCACTTCTAGCACTCGGTCCTCCAGGTGACGAAGTCCTAGAGGAAGAAGAGGAACGGC 
38 MTFITLWSTWQKLIEKKEKG- 

AACCAGGAGGGGCCCGTGGCGCCGGGCACCTGGGCGAGGATCGTGCCGAGGCCCGGCGGG 

59020 + + + + + + 59079 

TTGGTCCTCCCCGGGCACCGCGGCCCGTGGACCCGCTCCTAGCACGGCTCCGGGCCGCCC 
38 FWS PGTAGPVQALITGLGPP 

AAGTAGAGCTGTTCCAGGGAGTAGCGGGAGTACTCGGGCAACGAGCCGTCCAGCCTGTCC 

59080 + + + + + + 59139 

TTCATCTCGACAAGGTCCCTCATCGCCCTCATGAGCCCGTTGCTCGGCAGGTCGGACAGG 
38 FYLQELSYRSYEPLSGDLRD- 

CAGCTCGCCACGGTGGGCCCCTTGCCGATCTGGTTGGCCGCGTAGGCGAGCCCGTCGCCC 

59140 + + + + + + 59199 

GTCGAGCGGTGCCACCCGGGGAACGGCTAGACCAACCGGCGCATCCGCTCGGGCAGCGGG 
38 WSAVTPGKGIQNAAYALGDG 

CGGTCCAGGCCGAACAGCTCGGCCGCCTTGGCCTCGTCGATGATGGCGGTGTTGAGCCGC 

59200 + + + + + + 59 259 

GCCAGGTCCGGCTTGTCGAGCCGGCGGAACCGGAGCAGCTACTACCGCCACAACTCGGCG 
38 RDL GFLEAAKAEDI IATNLR- 

TCGCCGTTGCCGGAGAAGTAGCGTCCGACCGCTCGTGGCATGGTGCCCAGGTGGGCCTCG 

59260 + + + + + + 59319 

AGCGGCAACGGCCTCTTCATCGCAGGCTGGCGAGCACCGTACCACGGGTCCACCCGGAGC 

38 EGNGS FYRGVARPMTGLHAE 

CTGCGCTGGAGGATCACCGGGGTCGCGCCCGCGCCGGCCGCCATCACCACGATCTTCGCC 

59320 + + + + + + 59379 

GACGCGACCTCCTAGTGGCCCCAGCGCGGGCGCGGCCGGCGGTAGTGGTGCTAGAAGCGG 
38 SRQLIVPTAGAGAAMVVIKA- 

TCGATGACGCCGCTGCCCGCCTGGAGGCGGTAGTCGTCGTCGTGCACGACGTTGTAGTGC 

59380 + + + + + + 59439 

AGCTACTGCGGCGACGGGCGGACCTCCGCCATCAGCAGCAGCACGTGCTGCAACATCACG 
38 EIVGSGAQLRYDDDHVVNYH- 

ACCCGGTAGGAGCCGTCGGGGGTGCGCGAGAGGTGCTGGACCTCGTGCAGCGGGCGGATG 

59440 + + + + + + 59499 

TGGGCCATCCTCGGCAGCCCCCACGCGCTCTCCACGACCTGGAGCACGTCGCCCGCCTAC 

38 VRYSGDPTRSLHQVEHLPRI- 
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CGCGCCCCATGGGCGATGGCGGCGGGCAGGTAGTTGACCAGCAAGGACTGCTTGGCCTCG 

59500 + + + + + + 59559 

GCGCGGGGTACCCGCTACCGCCGCCCGTCCATCAACTGGTCGTTCCTGACGAACCGGAGC 
38 RAGHAIAAPLYNVLLSQKAE- 

AAGCGGCAGCCGGCCATCATCCAGTTGCAGTTCACGCACTTGGTGTTGTCGATGGCGACG 

59560 + + + + + + 59619 

TTCGCCGTCGGCCGGTAGTAGGTCAACGTCAAGTGCGTGAACCACAACAGCTACCGCTGC 
38 FRCGAMMWNCNVCKTNDIAV- 

GCGAGGGGGTTGGCGGTGCGGCCGGCGTGGTTGCACGCCGCGGCCCACAGTCCGCCGGCG 

59620 + + + + + + 59679 

CGCTCCCCCAACCGCCACGCCGGCCGCACCAACGTGCGGCGCCGGGTGTCAGGCGGCCGC 
38 ALPNATRGAHNCAAAWLGGA- 

TAGCTCACGTCGTTCCAGTCCTGCCGGGTCACGGAGAGGGACTCCTCGACACGGTCGTAC 

59680 + + + + + + 59739 

ATCGAGTGCAGCAAGGTCAGGACGGCCCAGTGCCTCTCCCTGAGGAGCTGTGCCAGCATG 
38 YSVDNWDQRTVSLSEEVRDY- 

CAGGGGTCCAGGGTTTCGCGGCTCACCGCCTGCGGCCACATCCGGCGTCCTATGGACCCC 

59740 + + + + + 4- 59799 

GTCCCCAGGTCCCAAAGCGCCGAGTGGCGGACGCCGGTGTAGGCCGCAGGATACCTGGGG 
38 WPDLTERSVAQPWMRRGISG- 

TGCCGGTCGAAGACGAAGCGCGGGGCGCGGGGCATCGCGGCGAAGTAGACGACGCTGCCG 

59800 + + + + + + 59859 

ACGGCCAGCTTCTGCTTCGCGCCCCGCGCCCCGTAGCGCCGCTTCATCTGCTGCGACGGC 
38 QRDFVFRPARPMAAFYVVSG- 

CCGCCCACACAGTTCCCGCCGAGGATGCTCATGCCGTCCCCGACCGTGAAGTCGAACGCC 

59860 + + + + + + 59919 

GGCGGGTGTGTCAAGGGCGGCTCCTACGAGTACGGCAGGGGCTGGCACTTCAGCTTGCGG 
38 GGVCNGGLI SMGDGVTFDFA- 

CTCGTGTACGAGGAGCCGAGTTTGTAGTCGTGCTCGAACTCCTTGCTCTCCAGCCACGGC 

59920 + + + + + + 59979 

GAGCACATGCTCCTCGGCTCAAACATCAGCACGAGCTTGAGGAACGAGAGGTCGGTGCCG 
38 RTYSSGLKYDHEFEKSELWP- 

CCGCGTTCCAGGACGGTGACGTCGGCGCCCCCCGCCGCCAGGTGGTAGGCGGCGATGGCA 

59980 + + + + + + 60039 

GGCGCAAGGTCCTGCCACTGCAGCCGCGGGGGGCGGCGGTCCACCATCCGCCGCTACCGT 
38 GRELVTVDAGGAALHYAAIA- 

CCGCCGAATCCGCTGCCGATGACGAGGACGTCCGTGCGCTCGGCCGTGGTGCTCATGCGG 

60040 + + + + + + 60099 

GGCGGCTTAGGCGACGGCTACTGCTCCTGCAGGCACGCGAGCCGGCACCACGAGTACGCC 

38 GGFGSGIVLVDTREATTSM 

(ORF3 9) * A 

GGCTCCCGGTGGACGTGGTGTCGGGGTGGAGGCGGGCGAACTCACGCCCGTAGCTGTAAT 

60100 + + + + + + - 60159 

CCGAGGGCCACCTGCACCACAGCCCCACCTCCGCCCGCTTGAGTGCGGGCATCGACATTA 

39 PSGTSTTDPHLRAFERGYSY 

CCTTGAAGCGCCACAGGCCGTCGGCGTCCGGCATGCTCAGGCCCATGGCCTCCAGTCCCG 

60160 + + + + + + 60219 

GGAACTTCGCGGTGTCCGGCAGCCGCAGGCCGTACGAGTCCGGGTACCGGAGGTCAGGGC 
39 DKFRWLGDADPMSLGMAELG 

GATGGCCGTCCTCCATCGCCTGTGCCGTGTTGAGGTGCGCGGCCGAATCGAAGGCCATGT 

60220 + + + + + + 60279 

CTACCGGCAGGAGGTAGCGGACACGGCACAACTCCACGCGCCGGCTTAGCTTCCGGTACA 
39 PH GDEMAQATNLHAASDFAM 
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TGCAGAAGAGGGACAGCAGCACCCAGAACTCCTTCTCGGGGTGGCCTGGTGTCGTCAGCC 

60280 + + + + + + 60339 

ACGTCTTCTCCCTGTCGTCGTGGGTCTTGAGGAAGAGCCCCACCGGACCACAGCAGTCGG 
39 NCFLSLLVWFEKEPHGPTTL 

GCTGGATCAGCGCGGCCCGGTCCGGGTAGTCGAGCGCCACGAAGGGCGGGACCGTCGGGT 

60340 + + + + + + 60399 

CGACCTAGTCGCGCCGGGCCAGGCCCATCAGCTCGCGGTGCTTCCCGCCCTGGCAGCCCA 
39 RQILAARDPYDLAVFPPVTP 

CGGGAGCCAGGCGGCGCTCCGCCGCGTAGGCCAGCGCGTGCTCGTTCACCAGGCGCACCA 

60400 + + + + + + 60459 

GCCCTCGGTCCGCCGCGAGGCGGCGCATCCGGTCGCGCACGAGCAAGTGGTCCGCGTGGT 
39 DPALRREAAYALAHENVLRV 



GGTCGTCCAGACCCTCGTGGATGCCGGTCGCATCCCATTGCAGGAGCTCCAGGGCTCCCG 

60460 + + + + + + 60519 

CCAGCAGGTCTGGGAGCACCTACGGCCAGCGTAGGGTAACGTCCTCGAGGTCCCGAGGGC 
39 LDDLGEHIGTADWQLLELAG 

CCTGGACGGCGCCACCGCCGGTGGACACCCCCGCGATGGCCCGGTCGTCCGCGAAGCGCT 

60520 + + + + + + 60579 

GGACCTGCCGCGGTGGCGGCCACCTGTGGGGGCGCTACCGGGCCAGCAGGCGCTTCGCGA 
39 AQVAGGGTSVGAIARDDAFR 

TCTGGCCCGGCACGATCGTGTCCGCGTAGGCCTCCAGGGTCATGGTCCGGATATCGCCGG 

60580 + + + + + + 60639 

AGACCGGGCCGTGCTAGCACAGGCGCATCCGGAGGTCCCAGTACCAGGCCTATAGCGGCC 
39 KQGPVITDAYAELTM (ORF39) 

CCGGCGCCCCTCGCTCATTGTCGTCGCGCAACTCGCTCTCCATTCTCGCAGTCCGGAGTG 

60640 + + + + + + 60699 

GGCCGCGGGGAGCGAGTAACAGCAGCGCGTTGAGCGAGAGGTAAGAGCGTCAGGCCTCAC 

GGATGCCTTGTGGCGAGGAGAAAGCTAGGTTCGTTCGACCGGTTCAAGCAACTAGCCAAA 

60700 + + + + + + 60759 

CCTACGGAACACCGCTCCTCTTTCGATCCAAGCAAGCTGGCCAAGTTCGTTGATCGGTTT 



GTCGAGGCGACCTTGAAACCGACTCCACGGAGTTGGCGCGAAGCGGCGGATGGATTACAC 

60760 + + + + + + 60819 

CAGCTCCGCTGGAACTTTGGCTGAGGTGCCTCAACCGCGCTTCGCCGCCTACCTAATGTG 

GCGCGGGCGAGCGGCTCACTAGTCTGGCCGCACGGATGTCTTCATCACCTGCACGTGGAA 

60820 + + + + -- + + 60879 

CGCGCCCGCTCGCCGAGTGATCAGACCGGCGTGCCTACAGAAGTAGTGGACGTGCACCTT 



AAGCTTCTGCACGGGCACCGCATGTGGAAGTGAGCCCTGGTCTCATGTCTTGGGGGAAAC 

60880 + + + + + + 60939 

TTCGAAGACGTGCCCGTGGCGTACACCTTCACTCGGGACCAGAGTACAGAACCCCCTTTG 

GTGAAAAGTGACTCTGCCCAACGCGCCGTGGAGCGATCACGCCGTGTCGTACGGATCGAT 

60940 + + + + + + 60999 

CACTTTTCACTGAGACGGGTTGCGCGGCACCTCGCTAGTGCGGCACAGCATGCCTAGCTA 
40 VKSDSAQRAVERSRRVVRID 

(ORF4 0) 

GAACTCATTCCCGCCGATTCCCCGCGCCTGAACGGAATCGATCGTTCCCATGTGCAGCGC 

61000 + + + + + + 61059 

CTTGAGTAAGGGCGGCTAAGGGGCGCGGACTTGCCTTAGCTAGCAAGGGTACACGTCGCG 
40 ELIPADSPRLNGIDRSHVQR 

CTCGCGACCGTGTACGCGTCCCTGCCGCCGGTCCTGGTGCACCGCCCGACCATGCGGGTC 

61060 + + + + + + 61119 

GAGCGCTGGCACATGCGCAGGGACGGCGGCCAGGACCACGTGGCGGGCTGGTACGCCCAG 
40 LATVYASLPPVLVHRPTMRV 
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GTCGACGGCATGCACCGCATCGGCGCGGCCCGCCTGAAGGGGCTGGACACGGTCGAGGTC 

61120 + + + + + + 61179 

CAGCTGCCGTACGTGGCGTAGCCGCGCCGGGCGGACTTCCCCGACCTGTGCCAGCTCCAG 
40 VDGMHRIGAARLKGLDTVEV 

ACCTTCTTCGAGGGCGCCGAGGAGCAGGTGTTCCTGCGTTCCGTCGCGGCGAACATCACC 

61180 + + + --- + + + 61239 

TGGAAGAAGCTCCCGCGGCTCCTCGTCCACAAGGACGCAAGGCAGCGCCGCTTGTAGTGG 
40 TFFEGAEEQVFLRSVAANIT 

AACGGCCTGCCGTTGTCGGTGGCCGACCGCAAGACCGCCGCGGCCCGCATTCTGGCCTCC 

61240 + + + + + + 61299 

TTGCCGGACGGCAACAGCCACCGGCTGGCGTTCTGGCGGCGCCGGGCGTAAGACCGGAGG 
40 NGLPLSVADRKTAAARI LAS 

CACCCGACCCTGTCCGACCGCGCGGTCGCCGCACACGTCGGCCTCGACGCCAAGACCGTG 

61300 + + + + + + 61359 

GTGGGCTGGGACAGGCTGGCGCGCCAGCGGCGTGTGCAGCCGGAGCTGCGGTTCTGGCAC 
40 HPTLSDRAVAAHVGLDAKTV 

GCGGGGGTACGGACGTGTTCAGCCGCGGGTTCTCCGCTGCTGAACATGCGCACCGGGGCG 

61360 + + + + + + 61419 

CGCCCCCATGCCTGCACAAGTCGGCGCCCAAGAGGCGACGACTTGTACGCGTGGCCCCGC 
40 AGVRTCSAAGS PLLNMRTGA 

GACGGCCGCGTCCACCCGTTGGACCGCACCGCCGAACGCCTGCACGCGGCCGCGCTGCTG 

61420 + + + + + + 61479 

CTGCCGGCGCAGGTGGGCAACCTGGCGTGGCGGCTTGCGGACGTGCGCCGGCGCGACGAC 
40 DGRVHPLDRTAER .LHAAALL 

ACCCAGGACCCGGGACTCCCGTTGCGCTCCGTCGTCGAGCAGACGGGGCTGTCGCTGGGC 

61480 + + + + + + 61539 

TGGGTCCTGGGCCCTGAGGGCAACGCGAGGCAGCAGCTCGTCTGCCCCGACAGCGACCCG 
40 TQDPGLPLRSVVEQTGLSLG 

ACGGCCCACGACGTCCGCCGTCGGCTGCTGCGGGGCGAGGACCCGGTCCCGCAGAACCGG 

61540 + + + + -. + + --■ 61599 

TGCCGGGTGCTGCAGGCGGCAGCCGACGACGCCCCGCTCCTGGGCCAGGGCGTCTTGGCC 
40 TAHDVRRRLLRGEDPVPQNR 

CAGAGCGCGATGCTGGAGCCGGGACTCGCCCCGCAGAAGAAGGCGACGGCCAAGCCGCCC 

61600 4- + + + + + 61659 

GTCTCGCGCTACGACCTCGGCCCTGAGCGGGGCGTCTTCTTCCGCTGCCGGTTCGGCGGG 
40 QSAMLEPGLAPQKKATAKPP 

GTCGGCCCGGCCGCCCGTCCGGTCCCGAAGGTGCCGCCCGCCGTCGCCGGCAGGCCGCCG 

61660 + + + + + + 61719 

CAGCCGGGCCGGCGGGCAGGCCAGGGCTTCCACGGCGGGCGGCAGCGGCCGTCCGGCGGC 
40 VGPAARPVPKVPPAVAGRPP 

GTGTCACCGCGGTCCCGGGCCCCGCTGGAGGCGCTGCGCAAGCTCTCCAACGACCCCTCC 

61720 + + + + + + 61779 

CACAGTGGCGCCAGGGCCCGGGGCGACCTCCGCGACGCGTTCGAGAGGTTGCTGGGGAGG 
40 VSPRSRAPLEALRKLSNDPS 

CTGCGCCACTCCGACCAGGGGCGCGAACTCATGCGCTGGCTGCACAACCGGTTCGTCGTC 

61780 + + + + + + 61839 

GACGCGGTGAGGCTGGTCCCCGCGCTTGAGTACGCGACCGACGTGTTGGCCAAGCAGCAG 
40 LRHSDQGRELMRWLHNRFVV 

GACGAGGCGTGGCGCCGGCGCGCGGACGCGGTCCCGGCCCACTGCGTCGACTCGATGGCG 

61840 + + + + + + 61899 

CTGCTCCGCACCGCGGCCGCGCGCCTGCGCCAGGGCCGGGTGACGCAGCTGAGCTACCGC 
40 DEAWRRRADAVPAHCVDSM A 
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GAGCTGGCGCAGCACTGCTCGGACGCCTGGCACCGGTTCGCCGAGGAGATGGTTCGGCGC 

61900 + + + + + + 61959 

CTCGACCGCGTCGTGACGAGCCTGCGGACCGTGGCCAAGCGGCTCCTCTACCAAGCCGCG 
40 ELAQHCSDAWHRFAEEMVRR 

CGGCACAGCGCCGCGGCCGACGGCTCCGGACTCCGCACGACTCAGCCAACTCGCCGTTGA 

61960 + + + + + + 62019 

GCCGTGTCGCGGCGCCGGCTGCCGAGGCCTGAGGCGTGCTGAGTCGGTTGAGCGGCAACT 

40 RHSAAADGSGLRTTQPTRR* 
(ORF40) - 

CGGCCTACTTCGACAGGGAGTTACGGTGACCACGAACACCATCGAGGACGCGGTCCGCCG 

62020 + + + + + + 62079 

GCCGGATGAAGCTGTCCCTCAATGCCACTGGTGCTTGTGGTAGCTCCTGCGCCAGGCGGC 
41 (0RF41) VTTNTIEDAVRR 

GGTCGTCGAGTACATGCACGTCAACCTGGGTCAGAACCTCACGATCGATGACATGGCGCG 

62080 + + + + + + 62139 

CCAGCAGCTCATGTACGTGCAGTTGGACCCAGTCTTGGAGTGCTAGCTACTGTACCGCGC 
41 VVEYMHVNLGQNLTI DDMAR 

CACGGCGATGTTCAGCAAGTTCCATTTCACCCGCATCTTCCGCGAAGTCACCGGTACCTC 

62140 + + + + + + 62199 

GTGCCGCTACAAGTCGTTCAAGGTAAAGTGGGCGTAGAAGGCGCTTCAGTGGCCATGGAG 

41 TAMFSKFHFTRI FREVTGTS 

TCCCGGGCGTTTCCTGTCCGCCTTACGGATTCAGGAGGCCAAGAGACTTCTCGTGCACAC 

62200 + + + + + + 62259 

AGGGCCCGCAAAGGACAGGCGGAATGCCTAAGTCCTCCGGTTCTCTGAAGAGCACGTGTG 
41 PGRFLSALRIQEAKRLLVHT- 

TGCACTCAGTGTGGCCGATATCAGCAGTCAGGTCGGCTACAGCAGTGTCGGTACTTTCAG 

62260 + + + + + + 62319 

ACGTGAGTCACACCGGCTATAGTCGTCAGTCCAGCCGATGTCGTCACAGCCATGAAAGTC 
41 ALSVADISSQVGYSSVGTFS- 

TTCTCGCTTCAAGGCCTGTGTGGGGCTTTCCCCGAGCGCCTATCGCGACTTCGGCGGGGT 

62320 + + + + + + 62379 

AAGAGCGAAGTTCCGGACACACCCCGAAAGGGGCTCGCGGATAGCGCTGAAGCCGCCCCA 
41 SRFKACVGLSPSAYRDFGGV- 

GCAGCCGGGTTTTCCCTCCGCCGCGGCCCGTCTCACTCCCACCGCGCACAATCCCTCCGT 

62380 + + + + + + 62439 

CGTCGGCCCAAAAGGGAGGCGGCGCCGGGCAGAGTGAGGGTGGCGCGTGTTAGGGAGGCA 
41 QPGFPSAAARLTPTAHNPSV- 

GCGCGGCCGCATTCACTCCGCCCCGGGTGACAGGCCCGGAAGGATCTTCGTGGGCCTGTT 

62440 + + + + + + 62499 

CGCGCCGGCGTAAGTGAGGCGGGGCCCACTGTCCGGGCCTTCCTAGAAGCACCCGGACAA 
41 RGRIHSAPGDRPGRI FVGLF- 

CCCCGGCAGGATGCGCCAGGGCCGCCCGGCGCGCTGGACCGTCATGGAGAGTCCCGGGGC 

62500 + + + + + + 62559 

GGGGCCGTCCTACGCGGTCCCGGCGGGCCGCGCGACCTGGCAGTACCTCTCAGGGCCCCG 
41 PGRMRQGRPARWTVMES PGA- 

CTTCGAGCTCCGGGACGTGCCCGTGGGCACCTGGCACATCCTGGTCCACTCCTTCCCCGC 

62560 + + + + + + 62619 

GAAGCTCGAGGCCCTGCACGGGCACCCGTGGACCGTGTAGGACCAGGTGAGGAAGGGGCG 
41 FEL RDVPVGTWH I L VHS FPA 

CGGACACCGGCCGCACCAGCTCGACTCCGAACCGCTGTTGCTCGGGCACAGCGGACCGCT 

62620 + + + + + + 62679 

GCCTGTGGCCGGCGTGGTCGAGCTGAGGCTTGGCGACAACGAGCCCGTGTCGCCTGGCGA 
41 GH RPHQLDSE PLLLGHSGPL 
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CGTGGTGCACCCCGGTGCCCTGCTCCGGCCGGCGGACATCCTCCTGCGCGCGGTGGACGC 

62680 + + + + + + 62739 

GCACCACGTGGGGCCACGGGACGAGGCCGGCCGCCTGTAGGAGGACGCGCGCCACCTGCG 
41 VVHPGALliRPADI LLRAVDA 

CCTCGATCCACCGGTCCTGtTGGCCCACTTCGCGCTGGAGAGCCGCCTCACCTCGCCGTA 

62740 + + + + + + 62799 

GGAGCTAGGTGGCCAGGACGACCGGGTGAAGCGCGACCTCTCGGCGGAGTGGAGCGGCAT 

41 LDPPVLLAHFALESRLTSPY 

42 (ORF42) * R A T - 

CTCACCGTCATCGGTAGCCCTCCGCGCATCCGCAGGGAGAGCATGGGTTCGGCAACCGCC 

62800 + + + + + + 62859 

GAGTGGCAGTAGCCATCGGGAGGCGCGTAGGCGTCCCTCTCGTACCCAAGCCGTTGGCGG 

41 SPSSVALRASAGRAWVRQPP- 

42 SVTMPLGGRMRLSLMPEAVA- 

CGGTGTCCGGCGACGGTACGCAGATCGAGATCGCGGGTGACCAGGGCCGTGACGAACACC 

62860 + + + + + + 62919 

GCCACAGGCCGCTGCCATGCGTCTAGCTCTAGCGCCCACTGGTCCCGGCACTGCTTGTGG 

41 GVRRRYADRDRG* (ORF41) 

42 RHGAVTRLDLDRTVLATVFV- 

GCCTCCATCATCCCGAGGTTGCTGCCGACGCAGAACCGGGGCCCCGCGCCGAACGGGATG 

62920 + + + + + + 62979 

CGGAGGTAGTAGGGCTCCAACGACGGCTGCGTCTTGGCCCCGGGGCGCGGCTTGCCCTAC 
42 AEMMGLNSGVCFRPGAGFPI - 

TACGCGTACCGCGGCCGGTCGGCGGTCTGCCGGGGTTCGAACCGCTCGGGGTCGAAGCGC 

62980 + + + + + + 63039 

ATGCGCATGGCGCCGGCCAGCCGCCAGACGGCCCCAAGCTTGGCGAGCCCCAGCTTCGCG 
42 YAYRPRDATQRPEFREPDFR- 

TCGGGGTCCTCCCACAGCCCCGGATGGCGGTGCATGATGTACGGGCAGACCAGCACATCC 

63040 + + + + + + 63099 

AGCCCCAGGAGGGTGTCGGGGCCTACCGCCACGTACTACATGCCCGTCTGGTCGTGTAGG 
42 EPDEWLGPHRHMIYPCVLVD- 

GATCCGGCGGACACCGTGTAGCCGCCGACCACATCGCGTTGCTGGGCCACCCTGGGCAGG 

63100 + + + + + + - 63159 

CTAGGCCGCCTGTGGCACATCGGCGGCTGGTGTAGCGCAACGACCCGGTGGGACCCGTCC 
42 S GAS VT YGGVVD RQQAVR PL 

ATCCC 

63160 + 63164 

TAGGG 
42 I G - 
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